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Assistant Commissioner for Patents 



Washington, DC 20231 
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Sir: 



For your review and consideration, listed below are Article 34 Amendments regarding 
this application. Applicants submit that support for the additional claims regarding fusion 
proteins comprising an afowaZ-associated sequence are found, for example, on page 52, Ins 
11-18. Support for claims to methods of therapy wherein an atonaZ-associated sequence is 
administered to a subject in conjunction with another therapeutic nucleic acid sequence may 
be found, for example, on page 58, Ins. 14-20, page 59, In. 20-page 62, In. 2 and page 62, Ins. 
5-13. 

Applicants submit herewith an additional page for these added amendments and the 
renumbered substitute page for the Abstract. 

47. A fusion protein comprising an atowaZ-associated amino acid sequence or 
fragment thereof and a desired amino acid sequence. 

48. A nucleic acid sequence encoding the fusion protein of claim 47. 

49. A method of delivering an ato«a/-associated amino acid sequence to an 
animal, wherein the method comprises administering to the animal a nucleic acid sequence 
encoding an atonaZ-associated amino acid sequence and a nucleic acid sequence encoding an 
additional therapeutic amino acid sequence. 

25077481 .1 * (09906355 / OTA 99-47) 



50. The method of claim 20, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

51. The method of claim 21, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

52. The method of claim 22, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

53. The method of claim 24, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

54. The method of claim 26, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

55. The composition of claim 40, wherein the composition further comprises an 
additional amino acid sequence or nucleic acid sequence that is not an atonal-associated 
nucleic acid sequence or amino acid sequence. 

It is believed that a $9.00 fee is due in connection with this paper; please charge the 
fee to our Deposit Account No. 06-2375 order 09906355. If any other fees are required, 
please charge them to the same account. A duplicate copy of this letter is enclosed for that 
purpose. 



Fulbright & Jaworski LX«P. 

1301 McKinney, Suite 5100 
Houston, Texas 77010-3095 
(713) 651-3735 

Dated: 28 September 2001 



Respectfully submitted, 




Melissa L. Sistrunk 
Registration No. 45,579 
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47. A fusion protein comprising an atowa/-associated amino acid sequence or 
fragment thereof and a desired amino acid sequence. 

48. A nucleic acid sequence encoding the fusion protein of claim 47. 

49. A method of delivering an atonal-associated amino acid sequence to an 
animal, wherein the method comprises administering to the animal a nucleic acid sequence 
encoding an atonal-associated amino acid sequence and a nucleic acid sequence encoding an 
additional therapeutic amino acid sequence, 

50. The method of claim 20, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

51. The method of claim 21, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

52. The method of claim 22, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

53. The method of claim 24, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

54. The method of claim 26, wherein the method comprises delivering an atonal- 
associated nucleic acid sequence and a second nucleic acid sequence encoding a non-atonal- 
associated therapeutic agent. 

55. The composition of claim 40, wherein the composition further comprises an 
additional amino acid sequence or nucleic acid sequence that is not an atonal-associaXed 
nucleic acid sequence or amino acid sequence. 
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ABSTRACT 

Compositions and methods are disclosed for the therapeutic use of an atonal- 
associated nucleic acid or amino acid sequence. Also, an animal heterozygous for an atonal- 
associated gene inactivation is also disclosed having at least one atowa/-associated nucleic 
acid sequence replaced by insertion of a heterologous nucleic acid sequence used to detect 
expression driven by an ato«#/-associated promoter sequence, wherein the inactivation of the 
atowtf /-associated nucleic acid sequence prevents expression of the atonal- associated gene. 
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COMPOSITIONS AJN BMETHODS FOR THE THERAPEUTIC-USE 
~~ O F AN A rOA^X-ASSOCIATED S E^TJ ENCEFOR DEAP^SS, 

OSTEOARTHRITIS ^ 
AND ABNORMAL CELL PROLIFERATION 



The work herein was supported by grants from the United States Government. 
The United States Government may have certain rights in the invention. 

This application claims priority to a provisional application Serial No. 60/ 1 3 7,060 
filed June 1, 1999 and to a second provisional application Serial No. 60/176,993 filed 
January 19, 2000. 

FIELD OF THE INVENTION 

The present invention relates in general to the field of genetic diagnosis and 
10 therapy and, more particularly, to the characterization and use of an afcraz/-associated 
nucleic acid or amino acid sequence, or any of its homologs or orthologs, as a therapeutic 
agent for the treatment of deafness, partial hearing loss, vestibular defects due to damage 
or loss inner ear hair cells, osteoarthritis, and abnormal cell proliferation. 

15 BACKGROUND OF THE INVENTION 

An intricate pattern of interactions within and between cells directs the sequential 
development of neurons from dividing neuroepithelial progenitor cells. Multiple 
extracellular and intracellular signals moderate this process. Among the key intracellular 
20 signals are transcription factors, which induce the expression of a cascade of genes. One 
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subclass of transcription factors, belonging to the basic helix-loop-helix (bHLH) family 
of proteins, is expressed early on when the decision to proliferate or differentiate is made. 
This function is a particularly crucial one as mutations in these genes early in 
development can wipe out entire neural structures. 
5 In Drosophila, the gene atonal (ato), which is homologous to Mathl, Math2, 

Hathl and Hath2, encodes a bHLH protein essential for the development of chordotonal 
organs (sensory organs found in the body wall, joints and antenna that function in 
proprioception, balance and audition) (Eberl, 1999; Mclver, 1985; van Staaden and 
Romer, 1 998). CHOs populate the peripheral nervous system (PNS) in the body wall and 
10 joints (thorax, abdomen, sternum, wings, legs) and antennae (Moulins, 1 976), providing 
the fly with sensory information much as touch and mechanoreceptors do in vertebrates 
(Mclver, 1985; Moulins, 1976). Boyan (Boyan, 1993) proposed that, in the course of 
evolution, different CHOs became specialized for hearing in different insects. This 
hypothesis was recently confirmed by van Staaden and Romer (1998). In Drosophila, 
15 CHOs in the Johnston organ, located in the second antennal segment, function in near 
field hearing (Dreller and Kirschner, 1993; Eberl, 1999) and negative geotaxis. 

During development ato is expressed in a cluster of progenitor cells from which 
the CHO founder cells are selected (Jarmanetal., 1993). It likely functions by regulating 
the expression of genes necessary for the specification and development of the CHO 
20 lineage; as it encodes a basic helix-loop-helix protein (bHLH) that dimerizes with the 
Daughterless protein and binds to E-box sequences, thereby activating genes (Jarman et 
aL, 1993). CHO specificity is encoded by the ato basic domain, which is required for 
DNA binding in bHLH proteins (Chien et aL, 1996; Davis et aL, 1990; Jarman and 
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Ahmed, 1998; Vaessin et al., 1990). ato is both necessary and sufficient for the 
generation of CHOs in the fly: loss of ato function leads to the loss of CHOs, while 
ectopic ato expression causes ectopic CHO formation (Jarman et al., 1993). Adult flies 
that lack atonal function are uncoordinated, do not fly, and are deficient in hearing. 
5 Overexpression of the fly atonal gene can generate new chordotonal neurons, indicating 
that atonal is both essential and sufficient for the development of this neuronal 
population. 

In vertebrates, during myogenesis and neurogenesis, cell fate specification 
requires basis helix-loop-helix (bHLH) transcription factors. Mathl (for mouse atonal 

1 0 homolog- 1 ) is such a factor, and is expressed in the hindbrain, dorsal spinal cord, external 
germinal layer of the cerebellum, gut, joints, ear and Merkel cells of the skin (which 
function as mechanoreceptors) (Akazawa et al.,1995; Ben-Arie et al., 1996; Ben-Arie 
et aL, 1997). Mice heterozygous for a targeted deletion of Mathl (Mathl +A ) are viable 
and appear normal, but Mathl null mice (Math' A ) die shortly after birth and lack 

1 5 cerebellar granule neurons. 

Mathl is one of ato's closest known homologs, with 82% amino acid similarity 
in the bHLH domain and 100% conservation of the basic domain that determines target 
recognition specificity (Ben-Arie et al., 1996; Chien et al., 1996). Mathl is transiently 
expressed in the CNS starting at embryonic day 9 (E9) in the dorsal portion of the neural 

20 tube. Mathl is also expressed in the rhombic lip of the fourth ventricle of the brain, 
where cerebellar granule cell precursors are born at El 3- 15 (Alder et al., 1996). Upon 
proliferation and differentiation, these progenitor cells migrate to form the external 
granule layer (EGL) of the cerebellar primordia (Hatten and Heintz, 1 995). Proliferating 
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EGL cells continue to express Mathl during the first three postnatal weeks, until shortly 
before they migrate to their final adult destination to generate the internal granule layer 
(IGL) of the cerebellum (Akazawa et al. 5 1995; Ben-Arie et aL, 1996). Another group 
of cells, a small population of neuronal precursors in the dorsal spinal cord, expresses 
5 Mathl during E10-E14 (Akazawa et aL, 1995; Ben-Arie et aL 5 1996). These precursor 
cells also express the LIM homeodomain proteins (LH2A and LH2B), markers of the Dl 
class of commissural interneurons (Lee et aL, 1998). Helms and Johnson (1998) reported 
that lacZ expression under the control of Mathl regulatory elements reproduced Mathl 
expression patterns in the developing cerebellum and spinal cord, and demonstrated that 
1 0 Mathl is expressed in precursors that give rise to a subpopulation of dorsal commissural 
interneurons. 

To determine the in vivo function of Mathl, the inventors generated mice 
{Mathl 1 ) lacking the MATH1 protein. This null mutation causes major cerebellar 
abnormalities: lack of granule cell proliferation and migration from the rhombic lip at 

15 E14.5, and absence of the entire EGL at birth (Ben-Arie et aL, 1997). It is not clear 
whether the agenesis of cerebellar granule neurons is due to failure of progenitor 
specification or the cells 1 inability to proliferate and/or differentiate. Neonates cannot 
breathe and die shortly afterbirth, but there are no gross defects in any cranial nerves or 
brain stem nuclei that could explain respiratory failure. 

20 The fact that Mathl is expressed in the inner ear suggests that Mathl expression 

is necessary for the development of auditory or balance organs. The inner ear initially 
forms as a thickening of the ectoderm, termed the otic placode, between rhombomeres 
5 and 6 in the hindbrain. The otic placode gives rise to neurons of the VIII th cranial 
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nerve and invaginates to become the otocyst, from which the inner ear will develop. The 
mature mammalian inner ear comprises one auditory organ, the cochlea, and five 
vestibular organs: the utricle, the saccule, and three semicircular canals. The sensory 
epithelia of these organs consist of mechanoreceptive hair cells, supporting cells and 

5 nerve endings. Hair cells serve as mechanoreceptors for transducing sound waves and 
head motion into auditory and positional information. Hair cells and supporting cells 
both arise from a common progenitor cell and proliferate and differentiate within the 
sensory epithelia, with peak mitoses between embryonic day 13 and 18 (E13-18) in mice. 
Although several genes have been implicated in the development of the inner ear, such 

10 as int2 (Mansour et al., 1993; pax2 (Torres et al., 1996; and Hmx3 (Wang et al., 1998). 
None have been shown to be required for the genesis of hair cell specifically. 

Damage to hair cells is a common cause of deafness and vestibular dysfunction, 
which are themselves prevalent diseases. Over 28 million Americans have impaired 
hearing; vestibular disorders affect about one-quarter of the general population, and half 

1 5 of our elderly. The delicate hair cells are vulnerable to disease, aging, and environmental 
trauma (i.e., antibiotics, toxins, persistent loud noise). Once these cells are destroyed, 
they cannot regenerate in mammals. Therefore, a need exists to address the problems of 
patients with congenital, chronic or acquired degenerative hearing impairment and loss 
or balance problems, and to provide compositions, methods and reagents for use in 

20 treating hearing loss and vestibular function. 

In support of the teaching of the present invention, others have demonstrated that 
Mathl, upon overexpression, induces significant production of extra hair cells in 
postnatal rat inner ears (Zheng and Gao, 2000). Briefly, although fate determination is 
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usually completed by birth for mammalian cochlear hair cells, overexpression oiMathl 
in postnatal rat cochlear explant cultures results in additional ear hair cells which derive 
from columnar epithelial cells located outside the sensory epithelium in the greater 
epithelial ridge. Furthermore, conversion of postnatal utricular supporting cells into hair 
5 cells is facilitated by Mathl expression. The ability oiMathl to permit production of 
hair cells in the ear is strong evidence in support of the claimed invention. 



SUMMARY OF THE INVENTION 
In one embodiment of the present invention there is an animal having a 

10 heterologous nucleic acid sequence replacing an allele of an <2towa/-associated nucleic 
acid sequence under conditions wherein said heterologous sequence inactivates said 
allele. In a preferred embodiment said heterologous sequence is expressed under control 
of an atona/-associated regulatory sequence. In a specific embodiment both atonal- 
associated alleles are replaced In an additional specific embodiment both atonal- 

15 associated alleles are replaced with nonidentical heterologous nucleic acid sequences. 
In an additional embodiment said animal has a detectable condition wherein said 
condition is selected from the group consisting of loss of hair cells, cerebellar granule 
neuron deficiencies, hearing impairment, imbalance, joint disease, osteoarthritis, 
abnormal proliferation of neoplastic neuroectodermal cells and formation of 

20 medulloblastoma. In another embodiment of the present invention said heterologous 
nucleic acid sequence is a reporter sequence selected from the group consisting of (3- 
galactosidase, green fluorescent protein (GFP), blue fluorescent protein (BFP), neomycin, 
kanamycin, luciferase, p-glucuronidase and chloramphenicol transferase (CAT). In 
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another specific embodiment said reporter sequence regulatable or is expressed in brain 
tissue, neural tissue, skin tissue, non-ossified cartilage cells, joint chondrocytes, Merkel 
cells, inner ear sensory epithelia and brain stem nuclei. In additional specific 
embodiments said atonaZ-associated allele is replaced with an atonal-associated nucleic 

5 acid sequence under control of a regulatable promoter sequence or a tissue-specific 
promoter sequence wherein said tissue is selected from the group consisting of brain 
tissue, neural tissue, skin tissue, non-ossified cartilage cells, joint chondrocytes, Merkel 
cells, inner ear sensory epithelia and brain stem nuclei. In additional embodiments said 
animal is a mouse, Drosophila, zebrafish, frog, rat, hamster or guinea pig. 

10 In another embodiment of the present invention is a method for screening for a 

compound in an animal, wherein said compound affects expression of an atonal- 
associated nucleic acid sequence comprising delivering said compound to said animal 
wherein said animal has at least one allele of an atfonaZ-associated nucleic acid sequence 
inactivated by insertion of a heterologous nucleic acid sequence wherein said 

15 heterologous nucleic acid sequence is under control of an atowaZ-associated regulatory 
sequence, and monitoring for a change in said expression of said atonaZ-associated 
nucleic acid sequence. In specific embodiments said compound upregulates or 
downregulates said expression of an <sfonaZ-associated nucleic acid sequence. In 
additional embodiments said animal is a mouse or Drosophila. In a specific embodiment 

20 the heterologous nucleic acid sequence is a reporter sequence. In an additional specific 
embodiment the heterologous nucleic acid sequence is selected from the group consisting 
of p-galactosidase, green fluorescent protein (GFP), blue fluorescent protein (BFP), 
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neomycin, kanamycin, luciferase, P-glucuronidase and chloramphenicol transferase 
(CAT). 

Another embodiment of the present invention is a compound which affects 
expression of an atonal-associated nucleic acid sequence. In specific embodiments said 

5 compound upregulates or downregulates expression of an atonal-associated nucleic acid 
sequence. In a specific embodiment said compound affects a detectable condition in an 
animal wherein said condition is selected from the group consisting of loss of hair cells, 
cerebellar granule neuron deficiencies, hearing impairment, an imbalance disorder, joint 
disease, osteoarthritis, abnormal proliferation of neoplastic neuroectodermal cells and 

1 0 formation of medulloblastoma. 

Another embodiment of the present invention is a method for screening for a 
compound in an animal, wherein said compound affects a detectable condition in said 
animal, comprising delivering said compound to said animal wherein at least one allele 
of an atonal-associated nucleic acid sequence in said animal is inactivated by insertion 

1 5 of a heterologous nucleic acid sequence, wherein said heterologous nucleic acid sequence 
is under the control of an atonal-associated regulatory sequence, and monitoring said 
animal for a change in the detectable condition. In a specific embodiment said detectable 
condition is selected from the group consisting of loss of hair cells, cerebellar granule 
neuron deficiencies, hearing impairment, an imbalance disorder, joint disease, 

20 osteoarthritis, abnormal proliferation of neoplastic neuroectodermal cells and formation 
of medulloblastoma. In another embodiment said delivery of said compound affects 
expression of said heterologous nucleic acid sequence. In specific embodiments said 
expression of said heterologous nucleic acid sequence is upregulated or downregulated. 
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In additional specific embodiments said animal is a mouse, Drosophila, zebrafish, frog, 
rat, hamster or guinea pig. 

Another embodiment of the present invention is a compound wherein said 
compound affects said detectable condition. In specific embodiments said compound 
5 affects expression of a heterologous nucleic acid sequence. In additional specific 
embodiments said compound upregulates or downregulates expression of a heterologous 
nucleic acid sequence. 

In other embodiments of the present invention are methods of treating an animal, 
including a human, for cerebellar granule neuron deficiencies, for promoting 
1 0 mechanoreceptive cell growth, for generating hair cells, for treating hearing impairment 
or an imbalance disorder, for treating a joint disease, for treating for an abnormal 
proliferation of cells, and for treating for a disease that is a result of loss of functional 
afo«a/-associated nucleic acid or amino acid sequence. Said methods include 
administering a therapeutically effective amount of an atonal-associated nucleic acid or 
1 5 amino acid sequence. In specific embodiments said administration is by a vector selected 
from the group consisting of an adenoviral vector, a retroviral vector, an adeno-associated 
vector, a plasmid, or any other nucleic acid based vector, a liposome, a nucleic acid, a 
peptide, a lipid, a carbohydrate and a combination thereof of said vectors. In a specific 
embodiment said vector is a non-viral vector or a viral vector. In another specific 
20 embodiment said vector is a cell. In a preferred embodiment said vector is an adenovirus 
vector comprising a cytomegalovirus IE promoter sequence and a SV40 early 
polyadenylation signal sequence. In another specific embodiment said cell is a human 
cell. In an additional specific embodiment said joint disease is osteoarthritis. In a 
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specific embodiment said atonal-associated nucleic acid or amino acid sequence is Hathl 
or Mathl. In another specific embodiment the cell contains an alteration in an atonal- 
associated nucleic acid or amino acid sequence. In an additional specific embodiment 
the amino acid sequence has at least about 80% identity to about 20 contiguous amino 
5 acid residues of SEQ ID NO:58. In an additional specific embodiment the nucleic acid 
sequence encodes a polypeptide which has at least about 80% identity to about 20 
contiguous amino acid residues of SEQ ID NO:58. 

In another embodiment of the present invention is a method for treating an animal 
for an abnormal proliferation of cells comprising altering ato«a/-associated nucleic acid 
10 or amino acid sequence levels in a celL In a specific embodiment said alteration is 
reduction or said nucleic acid or amino acid sequence contains an alteration. 

In another embodiment of the present invention is a composition comprising an 
<2fo*za/-associated amino acid sequence or nucleic acid sequence in combination with a 
delivery vehicle wherein said vehicle delivers a therapeutically effective amount of an 
15 atonal-associated nucleic acid sequence or amino acid sequence into a cell. In specific 
embodiments said vehicle is the receptor-binding domain of a bacterial toxin or any 
fusion molecule or is a protein transduction domain. In a specific embodiment said 
protein transduction domain is from the HIV TAT peptide. In a specific embodiment 
said aro«a/-associated amino acid sequence ornucleic acid sequence is Hathl or Mathl. 
20 In another embodiment of the present invention there is a composition to treat an 

organism for loss of hair cells, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said ato«a/-associated nucleic acid sequence. In another specific 
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embodiment the defect affects a regulatory sequence of said atonal-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for loss of hair cells, wherein said organism comprises 
defect in a nucleic acid sequence which is associated with regulation of an atonal- 

5 associated nucleic acid sequence. In an additional embodiment of the present invention 
there is a composition to treat an organism for loss of hair cells, wherein said organism 
comprises a defect in an amino acid sequence which is associated with regulation of an 
atonal-associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 

10 organism for a cerebellar neuron deficiency, wherein said organism comprises a defect 
in an atonaZ-associated nucleic acid sequence. In a specific embodiment the defect is a 
mutation or alteration of said atowtfZ-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said atonal-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 

15 composition to treat an organism for a cerebellar neuron deficiency, wherein said 
organism comprises defect in a nucleic acid sequence which is associated with regulation 
of an atowaZ-associated nucleic acid sequence. In an additional embodiment of the 
present invention there is a composition to treat an organism for a cerebellar neuron 
deficiency, wherein said organism comprises a defect in an amino acid sequence which 

20 is associated with regulation of an atowaZ-associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 
organism for hearing impairment, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
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alteration of said atonal-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said atowaJ-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for hearing impairment, wherein said organism 

5 comprises defect in a nucleic acid sequence which is associated with regulation of an 
ato«a/-associated nucleic acid sequence. In an additional embodiment of the present 
invention there is a composition to treat an organism for hearing impairment, wherein 
said organism comprises a defect in an amino acid sequence which is associated with 
regulation of an atonal-associated nucleic acid sequence. 

10 In another embodiment of the present invention there is a composition to treat an 

organism for imbalance, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said atonal-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said atonal-associated nucleic 

15 acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for imbalance, wherein said organism comprises defect 
in a nucleic acid sequence which is associated with regulation of an atonal-associated 
nucleic acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for imbalance, wherein said organism comprises a 

20 defect in an amino acid sequence which is associated with regulation of an atonal- 
associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 
organism for osteoarthritis, wherein said organism comprises a defect in an atonal- 
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associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said ato?za/-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said atowa /-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
5 composition to treat an organism for osteoarthritis, wherein said organism comprises 
defect in a nucleic acid sequence which is associated with regulation of an atonal- 
associated nucleic acid sequence. In an additional embodiment of the present invention 
there is a composition to treat an organism for osteoarthritis, wherein said organism 
comprises a defect in an amino acid sequence which is associated with regulation of an 

1 0 <2/craz/-associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 
organism for a joint disease, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said atona/-associated nucleic acid sequence. In another specific 

15 embodiment the defect affects a regulatory sequence of said ato«<z/-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for a joint disease, wherein said organism comprises 
defect in a nucleic acid sequence which is associated with regulation of an atonal- 
associated nucleic acid sequence. In an additional embodiment of the present invention 

20 there is a composition to treat an organism for a joint disease, wherein said organism 
comprises a defect in an amino acid sequence which is associated with regulation of an 
0fcwa/-associated nucleic acid sequence. 
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In another embodiment of the present invention there is a composition to treat an 
organism for abnormal proliferation of cells, wherein said organism comprises a defect 
in an atonaZ-associated nucleic acid sequence. In a specific embodiment the defect is a 
mutation or alteration of said aforaaZ-associated nucleic acid sequence. In another specific 
5 embodiment the defect affects a regulatory sequence of said afowaZ-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for abnormal proliferation of cells, wherein said 
organism comprises defect in a nucleic acid sequence which is associated with regulation 
of an atowaZ-associated nucleic acid sequence. In an additional embodiment of the 
1 0 present invention there is a composition to treat an organism for abnormal proliferation 
of cells, wherein said organism comprises a defect in an amino acid sequence which is 
associated with regulation of an afowaZ-associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 
organism for cancer, wherein said organism comprises a defect in an atowaZ-associated 
15 nucleic acid sequence. In a specific embodiment the defect is a mutation or alteration of 
said ato/zaZ-associated nucleic acid sequence. In another specific embodiment the defect 
affects a regulatory sequence of said atowaZ-associated nucleic acid sequence. In an 
additional embodiment of the present invention there is a composition to treat an 
organism for cancer, wherein said organism comprises defect in a nucleic acid sequence 
20 which is associated with regulation of an atonal-associated nucleic acid sequence. In an 
additional embodiment of the present invention there is a composition to treat an 
organism for cancer, wherein said organism comprises a defect in an amino acid 



14 



WO 00/73764 PCT/US00/15410 

sequence which is associated with regulation of an atonal-associsted nucleic acid 
sequence. In a specific embodiment said cancer is medulloblastoma. 



Other and further objects, features and advantages would be apparent and 
5 eventually more readily understood by reading the following specification and by 
reference to the company drawing forming a part thereof, or any examples of the 
presently preferred embodiments of the invention are given for the purpose of the 
disclosure. 

10 

DESCRIPTION OF THE DRAWINGS 
Figure 1 A and IB demonstrate that the inner ear p-Gal staining (blue) of Mathl 
heterozygous embryos as described hereinabove. Figure 1 A shows the otic vesicle (OV) 
at E12.5 and Figure IB the inner ear at E14.5 of Mathl +/ ^ Gal embryos. Sensory epithelia 
15 stained positively in the cochlea (C), saccule (S), utricle (U), and semicircular canal 
ampullae (SCA). A schematic diagram of the inner ear is depicted alongside the staining 
for reference, blue indicates location of the sensory epithelia. The original 
magnifications of the images taken under the microscope were xl 00 For Figure 1 A and 
x50 for Figure IB. 

20 Figures 2 A through 2F are scanning electron micrographs of El 8.5 inner ear 

sensory epithelia in wild-type and Maihl^ Gal/ ^ Gal mice. Wild-type mice epithelia are 
shown in Figures 2A, 2C, and 2E and null mouse epithelia in Figures 2B, 2D, and 2F. 
The organ of Corti of the cochlea are shown and indicated in Figures 2A and 2B. In the 
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wild-type mouse there are three rows of outer hair cells (1, 2, 3), one row of inner hair 
cells (I), all with hair bundles (HB). The tectorial membrane (TM), an accessory 
structure of the cochlea, may be observed at the bottom. Above the sensory epithelium 
are squamous cells (SQ) with rudimentary kinocilia (RK). In null mice (Figure 2B), there 
5 are only squamous cells. Crista ampullaries of a vertical semicircular canal are depicted 
in Figures 2C and 2D. The null mouse crista is similar to the wild-type in overall shape, 
including the septum (eminentia) cruciatum (EC), but is smaller. The macula of the 
uticle is the focus of Figure 2E and 2F. Again, the macula of the null mouse is smaller 
than the wild-type. Scale bars are as follows: 10 ]um in Figures 2A and 2B, 50 |im in 
10 Figures 2C and 2D, and 100 iim in Figures 2E and 2F. 

Figures 3 A through 3F are light micrographs of semi-thin transverse sections of 
inner ear sensory epithelia in wild-type mice (Figures 3 A, 3C, and 3E) and Mathl^ Gal/ ^ Gal 
(Figures 3B, 3D, and 3F), all mice were observed at El 8.5. As observed in the cochlea 
of wild-type mice, Figure 3 A, three outer hair cells (1, 2, 3) and one inner (I) hair cell are 
15 present. Conversely, the null mouse cochlea (Figure 3B) has only squamous cells (SQ) 
in the same region. Hair cells (HC) and supporting cells (SC) are present in the wild-type 
crista ampullaris (Figure 3C) and utricular macula (Figure 3E), but only supporting cells 
are present in null mice (Figure 3D and 3F). The crista was cut obliquely, accounting for 
the multiple layers of hair cells in Figure 3C. The otolithic membrane (OM), an 
20 accessory structure of the utricle, is present in both wild-type mice (Figure 3E) and null 
mice (Figure 3F). Scale bars equal 100 in (Figures 3A and 3B); 50 \xm in (Figures 
3C and 3D); and 25 |xm in (Figures 3E and 3F). 
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Figures 4A and 4B are transmission electron micrographs of El 8.5 utricular 
macula in wild-type and Mathl^ GaU ^ Gal mice. Figure 4A shows that the hair cells (HC) 
and supporting cells (SC) are present in wild-type utricular macula. By contrast, only 
supporting cells are present in the null mouse (Figure 4B). Hair cells have hair bundles 

5 (HB) and supporting cells have microvilli (MV), Hair cells are less electron-dense and 
have more apical nuclei than supporting cells, but only the latter have secretory granules 
(SG). Some immature hair cells (IM) are evident in the wild-type, but not in the null 
mouse. The scale bar in all the figures equals lOjim. 

Figures 5 A through 5F show the Calretinin staining pattern of inner ear sensory 

10 epithelia. Sections through the utricle of E16.5 wild-type (Figures A5 and 5C) and 
Mathl^ GaV ^ Gal (Figure 5B and 5D) littermates were counterstained withpropidium iodide 
(red) for confocal microscopy. Sections were cut through the crista ampullaris of El 8.5 
wild-type (Figure 5E) and MathP- GaW ' Gal (Figure 5F) were counterstained with DAPI 
(blue) for immunofluorescent microscopy. The crista is cut at an oblique angle, which 

15 accounts for the multiple layers of hair cells in (Figure 5E). Immunostaining of 
Calretinin (green, arrows) is evident in hair cells of wild-type (Figures 5 A, 5C, and 5E) 
but not null mice (Figures 5B, 5D, and 5F). Boxed areas in Figures 5 A and 5B indicate 
the regions magnified in Figures 5C and 5D. Scale bar equals 1 00 \xm in (Figure 5 A and 
5B), 15 jam in (Figures 5C and 5D) and an original magnification of x200 in (Figures 5E 

20 and5F). 

Figure 6 A and 6B show the expression pattern of Mathl in mouse articular 
cartilage using the MaihV^ Gal heterozygote. Figure 6 A shows the staining pattern of a 
P14 mouse forelimb and demonstrates expression in all joints. Figure 6B is a 
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magnification (20X) of an elbow j oint from the same mouse that demonstrates that Math 1 
is expressed exclusively in the non-ossified articular chondrocytes. 

Figure 7A through 7C show replacement of Mathl coding region by lacZ gene. 
Figure 7 A, Top, has a map of the Mathl genomic locus. The coding region is shown as 
5 a black box. The sites of the probes used to detect the wild-type and mutant alleles are 
shown as black bars. The targeting vector is in the middle with the sites for homologous 
recombination indicated by larger Xs. In the targeted locus shown at the bottom, lacZ 
is translated under the control of Mathl regulatory elements. Figure 7B demonstrates 
Southern blot analysis of embryonic stem cells using the 3 r external probe. The upper 
1 0 band represents wt allele and the lower band the targeted mutant allele (mut) in targeted 
clones. Figure 7C demonstrates Southern blot analysis of DNA from the progeny of 
heterozygous mice demonstrating the presence of the targeted allele and absence of the 
wild-type allele in Mathl mice (asterisks). The abbreviations are as follows: (A) 

Apal; (H) HindlH; (RI) EcoRI; (S) Sail; and (X) Xbal. 
15 Figures 8 A through 8H show Mathl /lacZ expression and cerebellar phenotype 

in Mathl^^ and Mathl ^s^s* mice. Figure 8A shows Mathl/lacZ expression in the 
dorsal neural tube at E9.5 and (Figure 8B) E10.5. Figure 8C indicates a section through 
the hind brain at El 0.5 has Mathl/lacZ expression in the dorsal portion (arrows). Figure 
8D demonstrates that in a spinal cord section from E12.5 embryo, dorsal cells migrate 
20 ventrally (arrows). Figure 8E shows at E14.5 expression is observed in the EGL 
progenitors at the rhombic lip and in migrating cells that will populate the EGL. Figure 
8F demonstrates in Mathl w*>& mice, Mathl/lacZ expression is limited to a few cells 
in the rhombic lip, which is significantly reduced in size. Figure 8G shows that at P0 
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Mathl/lacZ is expressed in the EGL. Figure 8H demonstrates that the EGL is absent in 
the null mice. Original magnification for Figures 8C through 8H waslOOx. 

Figure 9A through 9G shows expression of Mathl/lacZ in the inner ear and brain 
stem and histological analysis of ventral pontine nucleus. X-gal staining of El 8.5 
5 Mathl +/b -^ utricular crista (Figure 9A) and inner ear sensory epithelia of Matht^ 1 
(Figure 9B) and Mathl h&Xib ^ (Figure 9C). The Mathl/lacZ expression in the upper hair 
cell layer of the sensory epithelia of (Figures 9A and 9B) and the characteristic calyx 
appearance (arrowhead). In the null mice X-gal staining of epithelial cells is non-specific 
in the absence of hair cells (Figure 9C). Whole-brain X-gal staining of Mathl +Psy & x 
10 (Figure 9D) and Mathl h&A/b&a (Figure 9E) at El 8.5 is demonstrated. There is positive 
staining of the pontine nucleus (arrowhead) and cerebellum (arrow) in Mathl +/b ~& x mice, 
which is lacking or greatly reduced in null mutants in both the cerebellum, and pontine 
nucleus (inset). Figures 9F and 9G show haematoxylin and eosin staining of sagittal 
sections through the pons of a wild type and null mutant (Figures 9F and 9G, 
15 respectively), showing the loss of the ventral pontine nucleus in null mutants. The 
original magnifications were as follows: (A) 400x (B & C) lOOOx, (D & E) 8x, inset in 
D & E lOOx, (F & G) lOx. 

Figures 10A through 10E demonstrate Mathl/lacZ is expressed in joint 
chondrocytes. X-gal staining of whole embryos at (Figure 10A) E12.5 and (Figure 10B) 
20 E16.5 illustrates that Mathl/lacZ is expressed in all joints (Figure 10C). Horizontal 
section through the elbow joint of El 8.5 Mathl + ^ x mouse shows that it is expressed in 
resting chondrocytes (arrow). Figure 10D shows a horizontal section through a 
humero-radial joint at P10 that has expression in the articular chondrocytes (arrowhead) 
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and resting chondrocytes (arrow). Figure 10E shows high magnification of a section 
through a wrist joint indicating Mathl/lacZ is expressed in articular chondrocytes. The 
original magnification is was follows: (C) lOx; (D) 20x; and (E) 40x. 

Figures 11A through 11L show Mathl/lacZ expression in Merkel cells. To 
5 identify the structures stained on the hairy and non-hairy skin, El 6.5 littennate embryos 
were stained as whole mounts, sectioned, and microscopically examined. Shown are 
sections through the vibrissae (Figure 11 A), foot pad at low (Figure 11B) and high 
magnification of the region marked by an arrow in B (Figure 1 0C), and hairy skin (Figure 
1 ID). In all sections the localization of the stained cells was as expected from Merkel 
1 0 cells. To look for macroscopic defects in null mice, close-up pictures were taken through 
a stereomicroscope of Mathl**** (control, panels E-H) and Mathl (null, panels 

I-L) littermate mice. Staining in null mice appeared stronger because of a dosage effect 
in the vibrissae (E, I), limb joints (F, J), and foot pads (G, K). In contrast, the staining 
intensity of null (J, L) mice was markedly weaker than that of heterozygous (F, H) mice 
15 in the touch domes associated with the hairy skin. The original magnification is was 
follows: A x200; B x50; C x400; D x500; E-G-H-I-K-L x32; F-J xl6. 

Figures 12A through 12E show lack of facZ-stained touch domes in Tabby Trace. 
Tabby/Tabby females were crossed wifkiMathr^ males, and their progeny were X-gal 
stained and gender-determined at El 6. 5 . Staining around primary vibrissae in the snout 
20 was detected in both female embryos heterozygous for the Tabby mutation (Figure 12A) 
and male embryos hemizygous for the mutation (Figure 12B). Secondary vibrissae, 
which are known to vary in number in the Tabby mutants (black arrows), were also 
stained. The staining of the touch domes was less intense in the Tabby/Xfemale (Figure 
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12D) than Mathl*^ (wt for Tabby) embryos (Figure 12C), since Tabby is a 
semidominant mutation. However, patches of stained touch domes were detected in a 
female embryo that carried a wild-type allele at the Tabby locus (Figure 12 A, red arrow, 
and 12D). In contrast, a hemizygous male completely lacked both staining and touch 

5 domes, due to the loss of hair follicles that abolishes the development of Merkel cells 
(Figures 12B and 12E). 

Figures 13 A through 13F demonstrate marker analysis of Merkel cells in wild 
type and Mathl null mice. Skin sections from Mathl^ and Mathl b -s ai/b -s al reacte d w i t h 
antibodies against MATH1 (Figures 13A and 13B), cytokeratin 18 (Figures 13C and 

10 13D), and chromogranin A (Figures 13E and 13F). Polyclonal antibodies to MATH1 
identify multiple basal nuclei in rare abdominal hair follicles of wild type (Figure 13A) 
but not mutant mice (Figure 13B). Monoclonal antibodies to cytokeratin 18 and 
chromogranin A identify Merkel cells in both wild type (Figures 13C and 13E) and 
mutant (Figures 13D and 13F) mice. The original magnification was 100X. 

15 Figures 14A through 14G show Mathl rescues the lack of chordotonal neurons 

in Drosophila ato mutant embryos. Figure 14A shows a dorsal view of the thorax of a 
wild-type fly. Note there are regular array of bristles or macrochaetae. Figure 14B 
shows a similar view of a transgenic fly in which Mathl was overexpressed using the 
UAS/GAL4 system (Brand and Perrimon, 1993). This ectopic expression leads to 

20 numerous extra bristles that are external sensory organs (another type of mechano 
receptor), not CHOs. Ectopic CHOs were produced in many other regions. Figure 14C 
shows a lateral view of two abdominal clusters containing 6 CHOs in addition to external 
sensory organs, revealed by a neuronal-specific antibody (Mab 22C10). The 5 lateral 
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CHOs form a cluster, and the sixth is dorsal to the cluster. Figure 14D shows a similar 
view of an ato mutant embryo showing lack of the CHOs. Figure 14E demonstrates 
ubiquitous expression of Math 1 induces new CHO neurons in ato mutant embryos in the 
proper location. Figure 1 4F shows in situ hybridization of whole mount third instar brain 

5 using the ato cDNA as a probe. Note expression in the developing optic lobes ("horse 
shoe" expression patterns) and two punctate clusters of cells in the middle of the brain 
lobes (arrow heads). Figure 14G shows Mathl expression in Drosophila induces CHO 
formation in normal and ectopic locations. The (+) indicates presence of CHOs and (-) 
indicates their absence. Number of (+) in the first column is used to quantify the relative 

10 increase in the number of CHOs observed when Mathl is expressed. 



DETAILED DESCRIPTION OF THE INVENTION 
It is readily apparent to one skilled in the art that various embodiments and 
modifications may be made to the invention disclosed in this Application without 
15 departing from the scope and spirit of the invention. 

The term "abnormal proliferation" as used herein is defined as any proliferation 
of any type of cell, wherein said cell is not under the constraints of normal cell cycle 
progression and wherein said proliferation may result in a tumor or any cancerous 
development. 

20 The term "alteration" as used herein is defined as any type of change or 

modification to a nucleic acid or amino acid. Said change or modification includes any 
mutation, deletion, rearrangment, addition to a nucleic acid. This includes 
posttranscriptional processing such as addition of a 5' cap, intron processing and 
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polyadenylation. Mutations can be nonsense, missense, frameshift, or could lead to a 
truncated amino acid or could alter the conformation of the amino acid. The alteration 
to a nucleic acid may be present in regulatory sequences or may affect trans-acting 
factors. Also, multiple alterations may be present. Said change or modification also 
5 includes any change to an amino acid including methylation, myristilation, acetylation, 
glycosylation, or a change to signals associated with processing of said amino acid 
including intracellular or intercellular localization signals and cleaving of extraneous 
amino acids. Said alteration may also affect degradation or folding of said protein. 

The term "ato«a/-associated" as used herein is defined as any nucleic acid 
10 sequence or amino acid sequence which is the Drosophila atonal nucleic acid sequence 
or amino acid sequence, or is any sequence which is homologous to or has significant 
sequence similarity to said nucleic acid or amino acid sequence, respectively. The 
sequence can be present in any animal including mammals and insects. As used herein, 
significant sequence similarity means similarity is greater than 25% and can occur in any 
15 region of another sequence. Examples of atonaZ-associated include but are not limited 
to Math] (mouseofowa/homologl), Cathl (chicken atonal homolog±),Hathl (human 
atonal homolog 1), and Xathl (Xenopus atonal homolog 1). Furthermore, multiple 
homologous or similar sequences may exist in an animal. 

The term "cerebellar granule neuron deficiencies" as used herein is defined as any 
20 deficiency associated with cerebellar granule neurons and can include loss of cerebellar 
granule neurons, cerebellar granule neuron precursor cells, lack of granule cell 
proliferation, lack of granule cell migration and lack of cerebellar external granule layer 
cells. 
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The term "defect" as used herein is defined as an alteration, mutation, flaw or loss 
of expression of an afo/ia/-associated sequence. A skilled artisan is aware that loss of 
expression concerns expression levels of an afonaZ-associated sequence which are not 
significant or detectable by standard means in the art. A skilled artisan is also aware that 
5 loss, or absence, of expression levels in an adult organism, such as a human, occurs 
naturally and leads to impairment of hearing over time. Thus, "defect" as used herein 
includes the natural reduction or loss of expression of an <sfo«a/-associated sequence. 

The term "delivering" as used herein is defined as bringing to a destination and 
includes administering, as for a therapeutic purpose. 
10 The term "delivery vehicle" as used herein is defined as an entity which is 

associated with transfer of another entity. Said delivery vehicle is selected from the 
group consisting of an adenoviral vector, a retroviral vector, an adeno-associated vector, 
aplasmid, a liposome, a nucleic acid, apeptide, a lipid, a carbohydrate and a combination 
thereof. 

15 The term "detectable condition" as used herein is defined as any state of health 

or status of an animal, organ or tissue characterized by specific developmental or 
pathological symptoms. Examples include but are not limited to loss of hair cells, 
cerebellar granule neuron deficiencies, hearing impairment, imbalance, joint disease, 
osteoarthritis and abnormal proliferation of cells. 

20 The term "heterologous" as used herein is defined as nucleic acid sequence which 

is of or relating to nucleic acid sequence not naturally occurring in a particular locus. In 
an alternative embodiment, the heterologous nucleic acid sequence naturally occurs in 
aparticular locus, but contains a molecular alteration compared to the naturally occurring 
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locus. For instance, a wild-type locus of an atonal-associated sequence may be used to 
replace a defective copy of the same sequence. 

The term "imbalance disorder" as used herein is defined as a medical condition 
wherein an organism has impaired balance. In a specific embodiment the impairment is 
5 due to a defect of vestibular origin. In another specific embodiment the disorder is a 
vestibular disorder of balance perception including but not limited to Meniere disease, 
vertigo and layrinthitis. 

The term "inactivated" as used herein is defined as a state in which expression of 
a nucleic acid sequence is reduced or completely eliminated. Said inactivation can occur 
10 by transfer or insertion of another nucleic acid sequence or by any means standard in the 
art to affect expression levels of a nucleic acid sequence. 

The term "precursors" as used herein is defined as progenitor cells from which 
other cells derive their origin and/or properties. 

The term "regulatable reporter sequence" as used herein is defined as any 
15 sequence which directs transcription of another sequence and which itself is under 
regulatory control by an extrinsic factor or state. Examples of extrinsic factors or states 
include but are not limited to exposure to chemicals, nucleic acids, proteins, peptides, 
lipids, carbohydrates, sugars, light, sound, hormones, touch, or tissue-specific milieu. 
Examples of regulatable reporter sequences include the GAL promoter sequence and the 
20 tetracycline promoter/transactivator sequence. 

The term "regulatory sequence" as used herein is defined as any sequence which 
controls either directly or indirectly the transcription of another sequence. Said control 
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can be either regarding the initiation or cessation of transcription or regarding quantity 
or tissue distribution of transcription. 

The term "reporter sequence" as used herein is defined as any sequence which 
demonstrates expression by a regulatory sequence. Said reporter sequence can be used 

5 as a marker in the form of an RNA or in a protein. Examples of reporter sequences are 
b-galactosidase, green fluorescent protein (GFP), blue fluorescent protein (BFP), 
neomycin, kanamycin, luciferase, b-glucuronidase and chloramphenicol transferase 
(CAT). In a specific aspect of the present invention, the presence and quantity of the 
reporter sequence product, whether it be a nucleic acid or amino acid, reflects the level 

1 0 of transcription by the promoter sequence which regulates it. 

The term "therapeutically effective" as used herein is defined as the amount of 
a compound required to improve some symptom associated with a disease. For example, 
in the treatment of hearing impairment, a compound which improves hearing to any 
degree or arrests any symptom of hearing impairment would be therapeutically effective. 

15 In the treatment of a joint disease, a compound which improves the health or movement 
of a joint to any degree or arrests any symptom of a joint disease would be therapeutically 
effective. In the treatment of abnormal proliferation of cells, a compound which reduces 
the proliferation would be therapeutically effective. In the treatment of cancer, a 
compound which reduces proliferation of the cells, reduces tumor size, reduces 

20 metastases, reduces proliferation of blood vessels to said cancer, facilitates an immune 
response against the cancer would be therapeutically effective, for example. A 
therapeutically effective amount of a compound is not required to cure a disease but will 
provide a treatment for a disease. 
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The term "vector" as used herein is defined as a biological vehicle for delivery 
of a specific entity. In a specific embodiment the entity is an ato«a/-associated nucleic 
acid. 



5 In one aspect of the present invention there are methods and reagents which 

include utilization of an atonal-associated nucleic acid or amino acid sequence for the 
therapeutic use of detectible conditions such as loss of hair cells, cerebellar granule 
neuron deficiencies, hearing impairment, an imbalance disorder, joint disease, 
osteoarthritis and abnormal proliferation cells. Thus, any homolog or ortholog of atonal 

10 (from Drosophila) including but not limited to Cathl (from chicken), Hath] (from 
human), Mathl (from mice) or Xathl (from Xenopus) may be used in the present 
invention. In a preferred embodiment these sequences are directed to treatment of an 
animal, specifically a human, for the detectable conditions stated above. It is within the 
scope of the invention to encompass any sequence which is homologous to or has 

1 5 significant sequence similarity to said nucleic acid or amino acid sequence, respectively. 
The sequence can be present in any animal including mammals and insects. As used 
herein, significant sequence similarity means similarity (identity of amino acid residues 
or nucleic acid bases) is greater than 25% and can occur in any region of the sequence. 
In another embodiment an aftwaZ-associated sequence as used herein has greater than 

20 about 50% sequence similarity, greater than about 70% similarity, or greater than about 
80% similarity. 

It is within the scope of the present invention that an atowaZ-associated nucleic 
acid sequence or amino acid sequence is utilized wherein domains important for activity, 
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such as the basic HLH region, are included in a molecule but further comprise alterations, 
mutations, deletions or substitutions in regions of the nucleic acid or amino acid sequence 
which are not part of a domain important for an activity and do not affect its function. 
Examples of afo«a/-associated include but are not limited to Math 1 (mouse atonal 
5 homolog I), Cathl (chicken atonal homolog I), Hathl (human atonal homolog 1 ), and 
Xathl {Xenopus atonal homolog 1). Such examples are represented in SEQ ID NO:l 
through SEQ ID NO:66, although others very likely exist in related organisms. A skilled 
artisan is cognizant of means to identify such sequences which have significant 
similarity, such as searching database collections of nucleic and amino acid sequence 
10 located on the World Wide Web, including 
http://www.ncbi.nlm.nih.gov/Genbank/GenbaiikSearch.html. 

The sequences provided herein and the corresponding GenBank Accession 
numbers are listed parenthetically as follows: SEQ ID NO:l (NMJ305172); SEQ ID 
NO:2 (NP_005 163.1 ); SEQIDNO:3 (AW413228); SEQ ID NO: 4 (NM_009719); SEQ 
15 IDNO:5 (NP_033849.1); SEQIDNO:6 (NM_009718); SEQ ID NO: 7 (NP.033848.1) 
SEQ ID NO:8 (NM 009717); SEQ ID NO: 9 (NP_033847.1); SEQ ID NO:10 
(NM_007500); SEQ ID NO: 1 1(NP_03 1526.1); SEQIDNO:12 (NM_007501 ); SEQ ID 
NO:13 (AW280518); SEQIDNO:14(AW236965 ); SEQIDNO:15(AW163683); SEQ 
ID NO:16 (AF134869); SEQ ID NO: 17(AAD31451.1); SEQ ID NO:18 (AJ012660); 
20 SEQ ID NO:19 (CAA10106.1); SEQ ID NO:20 (AJ012659); SEQ ID NO:21 
(CAA10105.1); SEQ ED NO:22 (AF071223); SEQ ID NO:23 (AAC68868.1); SEQ ID 
NO:24 (U76208); SEQ ID NO:25 (AAC53029.1); SEQ ID NO:26 (U76210); SEQ ID 
NO:27 (AAC53033.1); SEQ ID NO:28 (U76209); SEQ ID NO:29 (AAC53032.1); SEQ 
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IDNO:30(U76207); SEQIDNO:31 (AAC53028.1); SEQ ID NO:32 (AF036257); SEQ 
ID NO:33 (AAC15969.1); SEQ ID NO:34 (AF034778); SEQ ID NO:35 (AJ001178); 
SEQ ID NO:36 (CAA04572.1); SEQ ID NO:37 (Y07621); SEQ ID NO:38 
(CAA68900.1); SEQ ID NO:39 (AF024536); SEQ ID NO:40 (AAB82272.1); SEQ ID 
5 NO:41 (D85188); SEQ ID NO:42 (BAA12738.1); SEQ ID NO:43 (D44480); SEQ ID 
NO:44(BAA07923.1); SEQ ID NO:45 (D43694); SEQ ID NO:46 (BAA07791.1); SEQ 
IDNO:47(D85845);SEQIDNO:48(BAA12880.1); SEQIDNO:49(U93171); SEQ ID 
NO:50 (AAB58669.1); SEQ IDNO:51 (U93170); SEQIDNO:52 (AAB58668.1); SEQ 
ID NO:53 (U61 1 52); SEQ ID NO:54 (AAB41 307. 1); SEQ ID NO:55 (U61 1 5 1); SEQ ID 
10 NO:56 (AAB41306.1); SEQ ID NO:57 (U61 148); SEQ IDNO:58 (AAB41305.1); SEQ 
IDNO:59 (U61 149); SEQ ID NO:60 (AAB41304. 1); SEQ ID NO:61 (U6 1 150); SEQ ID 
NO:62 (AAB41303.1); SEQ IDNO:63 (L36646); SEQIDNO:64(AAA21879.1); SEQ 
ID NO:69 (AA625732). 

In an aspect of the invention there is an animal having a heterologous nucleic acid 
15 sequence replacing an allele of an atonaZ-associated nucleic acid sequence under 
conditions wherein said heterologous sequence inactivates said allele. In an alternative 
embodiment a heterologous sequence is delivered to a cell for extrachromosomal 
propagation. In another alternative embodiment a heterologous sequence is integrated 
into the chromosome of a cell in a locus other than the locus of an atona Z-associated 
20 nucleic acid sequence. In a preferred embodiment said heterologous sequence is 
expressed under control of an afoMaZ-associated regulatory sequence. In a specific 
embodiment both afonaZ-associated alleles are replaced. In an additional specific 
embodiment both atorazZ-associated alleles are replaced withnonidentical heterologous 
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nucleic acid sequences. Methods to generate transgenic animals are well known in the 
art, and a skilled artisan would refer to such references as Transgenic Animals by 
Grosveld and Kollias (eds.) or Mouse Genetics and Transgenics : A Practical Approach 
by Jackson et al. (eds.). 

5 

In an additional embodiment a transgenic animal of the present invention has a 
detectable condition wherein said condition is selected from the group consisting of loss 
ofhair cells, cerebellar granule neuron deficiencies, hearing impairment, imbalance, joint 
disease, osteoarthritis and abnormal proliferation of cells. In another embodiment of the 

10 present invention a heterologous nucleic acid sequence is a reporter sequence selected 
from the group consisting of p-galactosidase, green fluorescent protein (GFP), blue 
fluorescent protein (BFP), neomycin, kanamycin, luciferase, (3-glucuronidase and 
chloramphenicol transferase (CAT). In another specific embodiment, a reporter sequence 
is regulatable or is expressed in brain tissue, neural tissue, skin tissue, non-ossified 

1 5 cartilage cells, j oint chondrocytes, Merkel cells, inner ear epithelial cells and brain stem 
nuclei. In additional specific embodiments said atona/-associated allele is replaced with 
an ato/z#/-associated nucleic acid sequence under control of a regulatable promoter 
sequence or a tissue-specific promoter sequence wherein said tissue is selected from the 
group consisting of brain tissue, neural tissue, skin tissue, non-ossified cartilage cells, 

20 joint chondrocytes, Merkel cells, inner ear epithelial cells and brain stem nuclei. In 
additional embodiments a transgenic animal is a mouse, Drosophila, frog, zebrafish, rat, 
guinea pig, or hamster. 



30 



WO 00/73764 PCT/US00/15410 

In another embodiment of the present invention is a method for screening for a 
compound in an animal, wherein said compound affects expression of an atonal- 
associated nucleic acid sequence comprising delivering said compound to said animal 
wherein said animal has at least one allele of an atonal-associated nucleic acid sequence 
5 inactivated by insertion of a heterologous nucleic acid sequence wherein said 
heterologous nucleic acid sequence is under control of an atona/-associated regulatory 
sequence, and monitoring for a change in said expression of said atonal-associated 
nucleic acid sequence. Examples of regulatory sequences may include promoter 
sequences, enhancers or silencers. 
L 0 hi specific embodiments there is a compound which upregulates or do wnregulates 

said expression of an a?ona/-associated nucleic acid sequence. The upregulation or 
downregulation may be by increasing the rate of transcription or decreasing the rate of 
mRNA decay. 

Another embodiment of the present invention is a compound which affects 
1 5 expression of an <tf ona/-associated nucleic acid sequence. In specific embodiments said 
compound upregulates or downregulates expression of an atonaZ-associated nucleic acid 
sequence. In a specific embodiment said compound affects a detectable condition in an 
animal wherein said condition is selected from the group consisting of loss of hair cells, 
cerebellar granule neuron deficiencies, hearing impairment, an imbalance disorder, joint 
20 disease, osteoarthritis, abnormal proliferation of cells and formation of cancer. 

Another embodiment of the present invention is a method for screening for a 
compound in an animal, wherein the compound affects a detectable condition in the 
animal, comprising delivering the compound to the animal wherein at least one allele of 
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an <zfo«a/-associated nucleic acid sequence in said animal is inactivated by insertion of 
a heterologous nucleic acid sequence, wherein said heterologous nucleic acid sequence 
is under the control of an afonaZ-associated regulatory sequence, and monitoring said 
animal for a change in the detectable condition. In a specific embodiment said detectable 
5 condition is selected from the group consisting of loss of hair cells, cerebellar granule 
neuron deficiencies, hearing impairment, an imbalance disorder, joint disease, 
osteoarthritis and abnormal proliferation of cells. In another embodiment said delivery 
of said compound affects expression of said heterologous nucleic acid sequence. In 
specific embodiments said expression of said heterologous nucleic acid sequence is 

1 0 upregulated or downregulated. In additional specific embodiments the animal is a mouse, 
Drosophila, frog, zebrafish, rat, hamster and guinea pig. 

Another embodiment of the present invention is a compound wherein said 
compound affects a detectable condition in a transgenic animal of the present invention. 
In specific embodiments said compound affects expression of a heterologous nucleic acid 

15 sequence. In additional specific embodiments said compound upregulates or 
downregulates expression of a heterologous nucleic acid sequence. 

In other embodiments of the present invention are methods of treating an animal, 
including a human, for cerebellar granule neuron deficiencies, for promoting 
mechanoreceptive cell growth, for generating hair cells, for treating hearing impairment 

20 or imbalance, for treating a joint disease, for treating for an abnormal proliferation of 
cells, and for treating for a disease that is a result of loss of functional atona/-associated 
nucleic acid or amino acid sequence. Said methods include administering a 
therapeutically effective amount of an atonal-associated nucleic acid or amino acid 
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sequence. In specific embodiments said administration is by a vector selected from the 
group consisting of a viral vector (including bacteriophage, animal and plant viruses), a 
plasmid, cosmid or any other nucleic acid based vector, a liposome, a nucleic acid, a 
peptide, a lipid, a carbohydrate and a combination thereof of said vectors. In a specific 
5 embodiment said viral vector is an adenovirus vector, a retrovirus vector, or an adeno- 
associated vector, including a lentivirus vector, Herpes virus vector, alpha virus vector, 
etc. Thus, the vector may be viral or non- viral. In a preferred embodiment said vector 
is an adenovirus vector comprising a cytomegalovirus IE promoter sequence and a S V40 
early polyadenylation signal sequence. In another specific embodiment said cell is a 

10 human cell. In an additional specific embodiment said joint disease is osteoarthritis. In 
an additional specific embodiment the cell contains an alteration in an a/o/za/-associated 
amino acid sequence, wherein said amino acid sequence has at least about 80% identity 
to about 20 contiguous amino acid residues of SEQ ID NO:58. 

In a specific embodiment, the present invention also provides a method of treating 

15 an animal in need of treatment for a deficiency in cerebellar granule neurons, a hearing 
impairment, an imbalance disorder, a joint disease, or in need of promoting 
mechanoreceptive cell growth, or a disease that is a result of loss of functional atonal- 
associated nucleic acid or amino acid sequences. This method comprises delivering a 
transcription factor having an amino acid with at least about 70% identity, preferably at 

20 least about 8 0% identity, and more preferably at least about 90% identity to the sequence 
AANARERRRMHGLNHAFDQLR to a cell in the animal. In some embodiments, the 
cell in the animal is located in the inner ear of the animal. Preferably, the transcription 
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factor competes with atonal for binding to Daughterless protein (Jarman et al., 1993) or 
competes for binding with Math-1 to E47 protein (Akazawa et al., 1995). 

In an embodiment of the present invention there is provided a method for treating 
an organism for a disease that is a result of loss of functional <ztona/-associated nucleic 

5 acid or amino acid sequence. A skilled artisan is aware that this loss may be due to 
natural reduction or absence of significant (or to detectable levels) expression which 
occurs in an adult human. 

In another embodiment of the present invention is a method for treating an animal 
for an abnormal proliferation of cells comprising altering atonal-associated nucleic acid 

10 or amino acid sequence levels in a cell. In a specific embodiment said alteration is 
reduction or said nucleic acid or amino acid sequence contains an alteration. 

In a preferred embodiment of the present invention there are compositions to treat 
an organism for various medical conditions, discussed herein, comprising an atonal- 
associated nucleic acid sequence or amino acid sequence in combination with a delivery 

15 vehicle, wherein said organism comprises a defect in an <2fona/-associated nucleic acid 
sequence. A skilled artisan is aware that an adult organism, such as an adult human, 
naturally does not express atonal to significant or detectable levels, but instead expresses 
atonal in an embryonic stage of development (see the Examples). Thus, in a preferred 
embodiment, compositions to treat an organism as discussed herein, include 

20 compositions to treat organisms who do not contain a mutation in an atonal nucleic acid 
or amino acid sequence but who naturally have atonal no longer expressed to significant 
or detectable levels. 
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In another embodiment of the present invention is a composition comprising an 
afonaZ-associated amino acid sequence or nucleic acid sequence in combination with a 
delivery vehicle wherein said vehicle delivers a therapeutically effective amount of an 
atonal-associated nucleic acid sequence or amino acid sequence into a cell. In specific 
5 embodiments said vehicle is the receptor-binding domain of a bacterial toxin or any 
fusion molecule or is a protein transduction domain. In a specific embodiment said 
protein transduction domain is from the HIV TAT peptide. 

In another embodiment of the present invention there is a composition to treat an 
organism for loss of hair cells, wherein said organism comprises a defect in an atonal- 

10 associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said atonaZ-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said afowaZ-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for loss of hair cells, wherein said organism comprises 

15 defect in a nucleic acid sequence which is associated with regulation of an atonal- 
associated nucleic acid sequence. In an additional embodiment of the present invention 
there is a composition to treat an organism for loss of hair cells, wherein said organism 
comprises a defect in an amino acid sequence which is associated with regulation of an 
atonal-associated nucleic acid sequence. 

20 In another embodiment of the present invention there is a composition to treat an 

organism for a cerebellar neuron deficiency, wherein said organism comprises a defect 
in an atonaUassociaXed nucleic acid sequence. In a specific embodiment the defect is a 
mutation or alteration of said atonal-associated nucleic acid sequence. In another specific 
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embodiment the defect affects a regulatory sequence of said atonal-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for a cerebellar neuron deficiency, wherein said 
organism comprises defect in a nucleic acid sequence which is associated with regulation 

5 of an atowa/-associated nucleic acid sequence. In an additional embodiment of the 
present invention there is a composition to treat an organism for a cerebellar neuron 
deficiency, wherein said organism comprises a defect in an amino acid sequence which 
is associated with regulation of an #/£?«a/-associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 

1 0 organism for hearing impairment, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said atonal-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said afonaZ-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 

15 composition to treat an organism for hearing impairment, wherein said organism 
comprises defect in a nucleic acid sequence which is associated with regulation of an 
itfonaZ-associated nucleic acid sequence. In an additional embodiment of the present 
invention there is a composition to treat an organism for hearing impairment, wherein 
said organism comprises a defect in an amino acid sequence which is associated with 

20 regulation of an atonal-associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 
organism for an imbalance disorder, wherein said organism comprises a defect in an 
afowaZ-associated nucleic acid sequence. In a specific embodiment the defect is a 
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mutation or alteration of said atonal-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said atonal-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for an imbalance disorder, wherein said organism 

5 comprises defect in a nucleic acid sequence which is associated with regulation of an 
atonal-associated nucleic acid sequence. In an additional embodiment of the present 
invention there is a composition to treat an organism for imbalance, wherein said 
organism comprises a defect in an amino acid sequence which is associated with 
regulation of an atonal-associated nucleic acid sequence. 

10 In another embodiment of the present invention there is a composition to treat an 

organism for osteoarthritis, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said afonaZ-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said atonaZ-associated nucleic 

15 acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for osteoarthritis, wherein said organism comprises 
defect in a nucleic acid sequence which is associated with regulation of an atonal- 
associated nucleic acid sequence. In an additional embodiment of the present invention 
there is a composition to treat an organism for osteoarthritis, wherein said organism 

20 comprises a defect in an amino acid sequence which is associated with regulation of an 
afowaZ-associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 
organism for a joint disease, wherein said organism comprises a defect in an atonal- 
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associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said tftewaZ-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said atonal-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
5 composition to treat an organism for a joint disease, wherein said organism comprises 
defect in a nucleic acid sequence which is associated with regulation of an atonal- 
associated nucleic acid sequence. In an additional embodiment of the present invention 
there is a composition to treat an organism for a joint disease, wherein said organism 
comprises a defect in an amino acid sequence which is associated with regulation of an 
1 0 atonaZ-associated nucleic acid sequence. 

In another embodiment of the present invention there is a composition to treat an 
organism for abnormal proliferation of cells, wherein said organism comprises a defect 
in an afowaZ-associated nucleic acid sequence. In a specific embodiment the defect is a 
mutation or alteration of said afo«a/-associated nucleic acid sequence. In another specific 
15 embodiment the defect affects a regulatory sequence of said atorcaZ-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a 
composition to treat an organism for abnormal proliferation of cells, wherein said 
organism comprises defect in a nucleic acid sequence which is associated with regulation 
of an a&wwjZ-associated nucleic acid sequence. In an additional embodiment of the 
20 present invention there is a composition to treat an organism for abnormal proliferation 
of cells, wherein said organism comprises a defect in an amino acid sequence which is 
associated with regulation of an atowaZ-associated nucleic acid sequence. 
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In another embodiment of the present invention there is a composition to treat an 
organism for cancer, wherein said organism comprises a defect in an aro«a/-associated 
nucleic acid sequence. In a specific embodiment the defect is a mutation or alteration of 
said atonal-associated nucleic acid sequence. In another specific embodiment the defect 

5 affects a regulatory sequence of said atonal-associated nucleic acid sequence. In an 
additional embodiment of the present invention there is a composition to treat an 
organism for cancer, wherein said organism comprises defect in a nucleic acid sequence 
which is associated with regulation of an atonal-associated nucleic acid sequence. In an 
additional embodiment of the present invention there is a composition to treat an 

10 organism for cancer, wherein said organism comprises a defect in an amino acid 
sequence which is associated with regulation of an atonal-associated nucleic acid 
sequence. In a specific embodiment said cancer is medulloblastoma. 

NUCLEIC ACID-BASED EXPRESSION SYSTEMS 

15 

1. Vectors 

One of skill in the art would be well equipped to construct a vector through 
standard recombinant techniques, which are described in Sambrook et al., 1989 and 
Ausubel et aL, 1994, both incorporated herein by reference. 

20 

The term "expression vector" refers to a vector containing a nucleic acid sequence coding 
for at least part of a gene product capable of being transcribed. In some cases, RNA 
molecules are then translated into a protein, polypeptide, or peptide. In other cases, these 
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sequences are not translated, for example, in the production of antisense molecules or 
ribozymes. Expression vectors can contain a variety of "control sequences," which refer 
to nucleic acid sequences necessary for the transcription and possibly translation of an 
operably linked coding sequence in a particular host organism. In addition to control 
5 sequences that govern transcription and translation, vectors and expression vectors may 
contain nucleic acid sequences that serve other functions as well and are described infra. 



a. Promoters and Enhancers 

10 A "promoter" is a control sequence that is a region of a nucleic acid sequence at 

which initiation and rate of transcription are controlled. It may contain genetic elements 
at which regulatory proteins and molecules may bind such as RNA polymerase and other 
transcription factors, A promoter may or may not be used in conjunction with an 
"enhancer," which refers to a c/s-acting regulatory sequence involved in the 

15 transcriptional activation of a nucleic acid sequence. 

Naturally, it will be important to employ a promoter and/or enhancer that 
effectively directs the expression of the DNA segment in the cell type, organelle, and 
organism chosen for expression. Those of skill in the art of molecular biology generally 
know the use of promoters, enhancers, and cell type combinations for protein expression, 

20 for example, see Sambrook et al. (1989), incorporated herein by reference. The 
promoters employed canbe constitutive, tissue-specific, inducible, and/or useful under 
the appropriate conditions to direct high level expression of the introduced DNA 
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segment, such as is advantageous in the large-scale production of recombinant proteins 
and/or peptides. The promoter canbe heterologous or endogenous. 

The identity of tissue-specific promoters or elements, as well as assays to 
characterize their activity, is well known to those of skill in the art. Examples of such 

5 regions include the human LIMK2 gene (Nomoto et al. 1 999), the somatostatin receptor 
2 gene (Kraus et al., 1998), murine epididymal retinoic acid-binding gene (Lareyre et al., 
1999), human CD4 (Zhao-Emonet et al., 1998), mouse alpha2 (XI) collagen (Tsumaki, 
et aL, 1998), Dl A dopamine receptor gene (Lee, et al., 1997), insulin-like growth factor 
II (Wu et al., 1997), human platelet endothelial cell adhesion molecule- 1 (Almendro et 

10 al., 1996). 

b. Initiation Signals and Internal Ribosome Binding Sites 

A specific initiation signal also canbe required for efficient translation of coding 
sequences. These signals include the ATG initiation codon or adjacent sequences. 

1 5 Exogenous translational control signals, including the ATG initiation codon, canneed to 
be provided. One of ordinary skill in the art would readily be capable of determining this 
and providing the necessary signals. It is well known that the initiation codon must be 
H in-frame" with the reading frame of the desired coding sequence to ensure translation 
of the entire insert. The exogenous translational control signals and initiation codons can 

20 be either natural or synthetic. The efficiency of expression canbe enhanced by the 
inclusion of appropriate transcription enhancer elements. 

In certain embodiments of the invention, the use of internal ribosome entry sites 
(IRES) elements are used to create multigene, or polycistronic, messages. IRES elements 
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can be linked to heterologous open reading frames. Multiple open reading frames can 
be transcribed together, each separated by an IRES, creating polycistronic messages. By 
virtue of the IRES element, each open reading frame is accessible to ribosomes for 
efficient translation. Multiple genes can be efficiently expressed using a single 
5 promoter/enhancer to transcribe a single message (see U.S. Patent 5,925,565 and 
5,935,8 19, herein incorporated by reference). 

c. Multiple Cloning Sites 

Vectors can include a multiple cloning site (MCS), which is a nucleic acid region 
1 0 that contains multiple restriction enzyme sites, any of which can be used in conjunction 
with standard recombinant technology to digest the vector. (See Carbonelh et al., 1 999, 
Levenson et al., 1998, and Cocea, 1997, incorporated herein by reference.) 

d. Splicing Sites 

15 Most transcribed eukaryotic RNA molecules will undergo RNA splicing to 

remove introns from the primary transcripts. Vectors containing genomic eukaryotic 
sequences canrequire donor and/or acceptor splicing sites to ensure proper processing of 
the transcript for protein expression. (See Chandler et al., 1 997, herein incorporated by 
reference.) 

20 

e. Polyadenylation Signals 

In expression, one will typically include a polyadenylation signal to effect proper 
polyadenylation of the transcript. Specific embodiments include the SV40 
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polyadenylation signal and/or the bovine growth hormone polyadenylation signal, 
convenient and/or known to function well in various target cells. 



f. Origins of Replication 

5 In order to propagate a vector in a host cell, it cancontain one or more origins of 

replication sites (often termed "ori"), which is a specific nucleic acid sequence at which 
replication is initiated. 

g. Selectable and Screenable Markers 

10 In certain embodiments of the invention, the cells contain nucleic acid construct 

of the present invention, a cell canbe identified in vitro or in vivo by including a marker 
in the expression vector. Such markers would confer an identifiable change to the cell 
permitting easy identification of cells containing the expression vector. Generally, a 
selectable marker is one that confers a property that allows for selection. A positive 

15 selectable marker is one in which the presence of the marker allows for its selection, 
while a negative selectable marker is one in which its presence prevents its selection. An 
example of a positive selectable marker is a drug resistance marker. 

Usually the inclusion of a drug selection marker aids in the cloning and 
20 identification of transformants, for example, genes that confer resistance to neomycin, 
puromycin, hygromycin, DHFR, GPT, zeocin and histidinol are useful selectable 
markers. In addition to markers conferring aphenotype that allows for the discrimination 
of transformants based on the implementation of conditions, other types of markers 
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including screenable markers such as GFP or enhanced GFP, whose basis is colorimetric 
analysis, are also contemplated. Alternatively, screenable enzymes such as herpes 
simplex virus thymidine kinase (tk) or chloramphenicol acetyltransferase (CAT) canbe 
utilized. One of skill in the art would also know how to employ immunologic markers, 
5 possibly in conjunction with FACS analysis. Further examples of selectable and 
screenable markers are well known to one of skill in the art. 

2. Expression Systems 

Numerous expression systems exist that comprise at least a part or all of the 
10 compositions discussed above. Prokaryote- and/or eukaryote-based systems can be 
employed for use with the present invention to produce nucleic acid sequences, or their 
cognate polypeptides, proteins and peptides. Many such systems are commercially and 
widely available. 

The insect cell/baculovirus system can produce a high level of protein expression 
15 of a heterologous nucleic acid segment, such as described in U.S. Patent No. 5,871,986, 
4,879,236, both herein incorporated by reference, and which can be bought, for example, 
under the name MAXBAC® 2.0 from INVITROGEN® and BACPACK™ 
B ACULOVIRUS EXPRESSION SYSTEM FROM CLONTECH® . Other examples of 
expression systems are well known in the art. 

20 

Nucleic Acid Detection 

In addition to their use in directing the expression of atonal-associated proteins, 
polypeptides and/or peptides, the nucleic acid sequences disclosed herein have a variety 
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of other uses. For example, they have utility as probes or primers or in any of the 
methods for embodiments involving nucleic acid hybridization, amplification of nucleic 
acid sequences, detection of nucleic acids, and other assays. A skilled artisan is aware 
of the following patents regarding details of these methods: U.S. Patent No. 5,840,873; 
5 U.S. Patent No. 5,843, 640; U.S. Patent No. 5,843,650; U.S. Patent No. 5,843,651; U.S. 
PatentNo. 5,843,663; U.S. PatentNo. 5,846,708; U.S. Patent No. 5,846,709; U.S. Patent 
No. 5,846,717; U.S. Patent No.5,846,726; U.S. Patent No. 5,846,729; U.S. Patent No. 
5,846,783; U.S. Patent No. 5,849,481; U.S. Patent No. 5,849,483; U.S. Patent No. 
5,849,486; U.S. Patent No. 5,849,487; U.S. Patent No. 5,849,497; U.S. Patent No. 
10 5,849,546; U.S. Patent No. 5,849,547; U.S. Patent No. 5,851,770; U.S. Patent No. 
5,851,772; U.S. Patent No.5,853,990; U.S. Patent No. 5,853, 993; U.S. Patent No. 

5,853,992; U.S. Patent No. 5,856,092; U.S. Patent No. 5,858,652; U.S. Patent No. 

5,861,244; U.S. Patent No.5,863,732; U.S. Patent No. 5,863,753; U.S. Patent No. 

5,866,331; U.S. Patent No. 5,866,336; U.S. Patent No. 5,866,337; U.S. Patent No. 
15 5,900,481; U.S. Patent No. 5,905,024; U.S. Patent No. 5,910,407; U.S. Patent No. 

5,912,124; U.S. Patent No. 5,912,145; U.S. Patent No. 5,912,148; U.S. Patent No. 

5,916,776; U.S. Patent No. 5,916,779; U.S. Patent No. 5,919,626; U.S. Patent No. 

5,919,630; U.S. Patent No. 5,922, 574; U.S. Patent No. 5,925,517; U.S. Patent No. 

5,925,525; U.S. Patent No. 5,928,862; U.S. Patent No. 5,928,869; U.S. Patent No. 
20 5,928,870; U.S. Patent No. 5,928,905; U.S. Patent No. 5,928,906; U.S. Patent No. 

5,929,227; U.S. Patent No. 5,932,413; U.S. Patent No. 5,932,451; U.S. Patent No. 

5,935,791; U.S. Patent No. 5,935,825; U.S. Patent No. 5,939,291; U.S. Patent No. 

5,942,391; European Application No. 320 308; European Application No. 329 822; GB 
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Application No. 2 202 328; PCT Application No. PCT/US87/00880; PCT Application 
No. PCT/US89/01025; PCT Application WO 88/10315; PCT Application WO 
89/06700; and PCT Application WO 90/07641. 

5 Kits 

All the essential materials and/or reagents required for detecting a sequence 
selected from SEQ ID NO:l through SEQ ID NO:66 in a sample canbe assembled 
together in a kit. This generally will comprise a probe or primers designed to hybridize 
specifically to individual nucleic acids of interest in the practice of the present invention, 

10 such as the nucleic acid sequences in SEQ ID NO:l through SEQ ID NO:66. Also 
included canbe enzymes suitable for amplifying nucleic acids, including various 
polymerases (reverse transcriptase, Taq, etc.), deoxynucleotides and buffers to provide 
the necessary reaction mixture for amplification. Such kits canalso include enzymes and 
other reagents suitable for detection of specific nucleic acids or amplification products. 

15 Such kits generally will comprise, in suitable means, distinct containers for each 
individual reagent or enzyme as well as for each probe or primer pair. 



Atonal-Associated Nucleic Acids 
20 A, Nucleic Acids and Uses Thereof 

The term "nucleic acid" will generally refer to at least one molecule or strand of 
DNA, RNA or a derivative or mimic thereof, comprising at least one nucleobase, such 
as, for example, a naturally occurring purine or pyrimidine base found in DNA (e.g. 
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adenine "A," guanine "G," thymine "T" and cytosine "C") or RNA (e.g. A, G, uracil "U" 
and C). The term "nucleic acid" encompass the terms "oligonucleotide" and 
"polynucleotide." The term "oligonucleotide" refers to at least one molecule of between 
about 3 and about 1 00 nucleobases in length. The term "polynucleotide" refers to at least 
5 one molecule of greater than about 100 nucleobases in length. These definitions 
generally refer to at least one single-stranded molecule, but in specific embodiments will 
also encompass at least one additional strand that is partially, substantially or fully 
complementary to the at least one single-stranded molecule. Thus, a nucleic acid 
canencompass at least one double-stranded molecule or at least one triple-stranded 
1 0 molecule that comprises one or more complementary strand(s) or "complement(s)" of a 
particular sequence comprising a strand of the molecule. As used herein, a single 
stranded nucleic acid canbe denoted by the prefix "ss", a double stranded nucleic acid by 
the prefix "ds", and a triple stranded nucleic acid by the prefix "ts." 

Thus, the present invention also encompasses at least one nucleic acid that is 
15 complementary to a atonal-associated nucleic acid. In particular embodiments the 
invention encompasses at least one nucleic acid or nucleic acid segment complementary 
to the nucleic acid sequences set forth in SEQ ID NO: 1 through SEQ ID NO:66, of those 
which are nucleic acid sequences. Nucleic acid(s) that are "complementary" or 
"complement^)" are those that are capable of base-pairing according to the standard 
20 Watson-Crick, Hoogsteen or reverse Hoogsteen binding complementarity rules. As used 
herein, the term "complementary" or "complement(s)" also refers to nucleic acid(s) that 
are substantially complementary, as canbe assessed by the same nucleotide comparison 
set forth above. The term "substantially complementary" refers to a nucleic acid 
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comprising at least one sequence of consecutive nucleobases, or semiconsecutive 
nucleobases if one or more nucleobase moieties are not present in the molecule, are 
capable of hybridizing to at least one nucleic acid strand or duplex even if less than all 
nucleobases do not base pair with a counterpart nucleobase. 
5 Herein certain embodiments, a "gene" refers to a nucleic acid that is transcribed. 

As used herein, a "gene segment" is a nucleic acid segment of a gene. In certain aspects, 
the gene includes regulatory sequences involved in transcription, or message production 
or composition. In particular embodiments, the gene comprises transcribed sequences 
that encode for a protein, polypeptide or peptide. In other particular aspects, the gene 
10 comprises an atonal-associated nucleic acid, and/or encodes an atonal-associated 
polypeptide or peptide coding sequences. In keeping with the terminology described 
herein, an "isolated gene" cancomprise transcribed nucleic acid(s), regulatory sequences, 
coding sequences, or the like, isolated substantially away from other such sequences, 
such as other naturally occurring genes, regulatory sequences, polypeptide or peptide 
15 encoding sequences, etc. In this respect, the terai "gene" is used for simplicity to refer 
to a nucleic acid comprising a nucleotide sequence that is transcribed, and the 
complement thereof. In particular aspects, the transcribed nucleotide sequence comprises 
at least one functional protein, polypeptide and/or peptide encoding unit. As will be 
understood by those in the art, this function term "gene" includes both genomic 
20 sequences, RNA or cDNA sequences or smaller engineered nucleic acid segments, 
including nucleic acid segments of a non-transcribed part of a gene, including but not 
limited to the non-transcribed promoter or enhancer regions of a gene. Smaller 
engineered gene nucleic acid segments canexpress, or canbe adapted to express using 
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nucleic acid manipulation technology, proteins, polypeptides, domains, peptides, fusion 
proteins, mutants and/or such like. 

In certain embodiments, the nucleic acid sequence is a nucleic acid or nucleic 
acid segment. As used herein, the term "nucleic acid segment", are smaller fragments of 
5 a nucleic acid, such as for non-limiting example, those that encode only part of the 
atonal-associated peptide or polypeptide sequence. Thus, a "nucleic acid segment" 
cancomprise any part of the atonal-associated gene sequence(s), of from about 2 
nucleotides to the full length of the atonal-associated peptide or polypeptide encoding 
region. In certain embodiments, the "nucleic acid segment" encompasses the full length 
10 atonal-associated gene(s) sequence. In particular embodiments, the nucleic acid 
comprises any part of the SEQ ID NO:l through SEQ ID NO:66, of from about 2 
nucleotides to the full length of the sequence disclosed in SEQ ID NO: 1 through SEQ ID 
NO:66. 

In certain embodiments, the nucleic acid segment canbe a probe or primer. As 
1 5 used herein, a "probe" is a nucleic acid utilized for detection of another nucleic acid and 
is generally at least about 10 nucleotides in length. As used herein, a "primer" is a 
nucleic acid utilized for polymerization of another nucleic acid is generally at least about 
10 nucleotides in length. A non-limiting example of this would be the creation of nucleic 
acid segments of various lengths and sequence composition for probes and primers based 
20 on the sequences disclosed in SEQ ID NO: 1 through SEQ ID NO:66, of those which are 
nucleic acid sequences. 

The nucleic acid(s) of the present invention, regardless of the length of the 
sequence itself, canbe combined with other nucleic acid sequences, including but not 
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limited to, promoters, enhancers, polyadenylation signals, restriction enzyme sites, 
multiple cloning sites, coding segments, and the like, to create one or more nucleic acid 
constract(s). As used herein, a "nucleic acid construct" is a recombinant molecule 
comprising at least two segments of different nucleic acid sequence. The overall length 
5 canvary considerably between nucleic acid constructs. Thus, a nucleic acid segment of 
almost any length canbe employed, with the total length preferably being limited by the 
ease of preparation or use in the intended recombinant nucleic acid protocol. 

In certain embodiments, the nucleic acid construct is a recombinant vector. As 
used herein, a "recombinant vector" is a nucleic acid comprising multiple segments of 
10 nucleic acids utilized as a vehicle for a nucleic acid sequence of interest. In certain 
aspects, the recombinant vector is an expression cassette. As used herein, an expression 
cassette is a segment of nucleic acid which comprises a gene of interest which canbe 
transfered between different recombinant vectors by means well known in the art. 

In particular embodiments, the invention concerns one or more recombinant 
15 vector(s) comprising nucleic acid sequences that encode an atonal-associated protein, 
polypeptide or peptide that includes within its amino acid sequence a contiguous amino 
acid sequence in accordance with, or essentially as set forth in, SEQ ID NO:2 through 
SEQ ID NO:66, of which sequences are amino acid sequences, corresponding to Homo 
sapiens or Mus musculus atonal-associated sequence.. In other embodiments, the 
20 invention concerns recombinant vector(s) comprising nucleic acid sequences from other 
species that encode an atonal-associated protein, polypeptide or peptide that includes 
within its amino acid sequence a contiguous amino acid sequence in accordance with, or 
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essentially as set forth in SEQ ID NO:2 through SEQ ID NO:66, of which sequences are 
amino acid sequences. In particular aspects, the recombinant vectors are DNA vectors. 

It will also be understood that amino acid sequences or nucleic acid sequences 
caninclude additional residues, such as additional N- or C-terminal amino acids or 5' or 

5 3* sequences, or various combinations thereof, and yet still be essentially as set forth in 
one of the sequences disclosed herein, so long as the sequence meets the criteria set forth 
above, including the maintenance of biological protein, polypeptide or peptide activity 
where expression of aproteinaceous composition is concerned. The addition of terminal 
sequences particularly applies to nucleic acid sequences that may, for example, include 

10 various non-coding sequences flanking either of the 5 ! and/or 3 f portions of the coding 
region or can include various internal sequences, i.e., introns, which are known to occur 
within genes. 

It will also be understood that this invention is not limited to the particular nucleic 
acid or amino acid sequences of SEQ ID NO: through SEQ ID NO:66, of which 

1 5 sequences are amino acids. Recombinant vectors and isolated nucleic acid segments can 
therefore variously include these coding regions themselves, coding regions bearing 
selected alterations or modifications in the basic coding region, and they canencode larger 
polypeptides or peptides that nevertheless include such coding regions or canencode 
biologically functional equivalent proteins, polypeptide or peptides that have variant 

20 amino acids sequences. 

The nucleic acids of the present invention encompass biologically fiinctional 
equivalent atonal-associated proteins, polypeptides, or peptides or atonal-associated 
proteins, polypeptides or polypeptides. Such sequences canarise as a consequence of 
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codon redundancy or functional equivalency that are known to occur naturally within 
nucleic acid sequences or the proteins, polypeptides or peptides thus encoded. 
Alternatively, functionally equivalent proteins, polypeptides or peptides can be created 
via the application of recombinant DNA technology, in which changes in the protein, 
5 polypeptide or peptide structure can be engineered, based on considerations of the 
properties of the amino acids being exchanged. Changes designed by man can be 
introduced, for example, through the application of site-directed mutagenesis techniques 
as discussed herein below, e.g., to introduce improvements or alterations to the 
antigenicity of the protein, polypeptide or peptide, or to test mutants in order to examine 

10 atonal-associated protein, polypeptide or peptide activity at the molecular level. 

Fusion proteins, polypeptides or peptides can be prepared, e.g., where the atonal 
associated coding regions are aligned within the same expression unit with other proteins, 
polypeptides or peptides having desired functions. Non-limiting examples of such 
desired functions of expression sequences include purification or immunodetection 

1 5 purposes for the added expression sequences, e.g., proteinaceous compositions that canbe 
purified by affinity chromatography or the enzyme labeling of coding regions, 
respectively EP 266,032, or via deoxynucleoside H-phosphonate intermediates as 
described by Froehler et al, Nucl. Acids Res., 14:5399-5407, 1986, 

As used herein an "organism" can be a prokaryote, eukaryote, virus and the like. 

20 As used herein the term "sequence" encompasses both the terms "nucleic acid" and 
"proteancecous" or "proteanaceous composition." As used herein, the term 
"proteinaceous composition" encompasses the terms "protein", "polypeptide" and 
"peptide." As used herein "artificial sequence" refers to a sequence of a nucleic acid not 
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derived from sequence naturally occurring at a genetic locus, as well as the sequence of 
any proteins, polypeptides or peptides encoded by such a nucleic acid. A "synthetic 
sequence", refers to a nucleic acid or proteinaceous composition produced by chemical 
synthesis in vitro, rather than enzymatic production in vitro (i.e. an "enzymatically 
5 produced" sequence) or biological production in vivo (i.e. a "biologically produced" 
sequence). 



Cancer Therapies 

10 Given the present invention is directed to methods and compositions for the 

treatment of abnormal cell proliferation, a discussion of therapies of cancer, which is the 
state of abnormal cell proliferation, is warranted. 

A wide variety of cancer therapies, such as radiotherapy, surgery, chemotherapy 
and gene therapy, are known to one of skill in the art, canbe used regarding the methods 

1 5 and compositions of the present invention. 

Radiotherapeutic agents 

Radiotherapeutic agents and factors include radiation and waves that induce DNA 
damage for example, g-irradiation, X-rays, UV-irradiation, microwaves, electronic 
20 emissions, radioisotopes, and the like. Therapy can be achieved by irradiating the 
localized tumor site with the above described forms of radiations. 

Dosage ranges for X-rays range from daily doses of 50 to 200 roentgens for 
prolonged periods of time (3 to 4 weeks), to single doses of 2000 to 6000 roentgens. 
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Dosage ranges for radioisotopes vary widely, and depend on the half-life of the isotope, 
the strength and type of radiation emitted, and the uptake by the neoplastic cells. 



Surgery 

5 Surgical treatment for removal of the cancerous growth is generally a standard 

procedure for the treatment of tumors and cancers. This attempts to remove the entire 
cancerous growth. However, surgery is generally combined with chemotherapy and/or 
radiotherapy to ensure the destruction of any remaining neoplastic or malignant cells. 
Thus, surgery can be used in the context of the present invention. 

10 

Chemotherapeutic Agents 

These can be, for example, agents that directly cross-link DNA, agents that 
intercalate into DNA, or agents that lead to chromosomal and mitotic aberrations by 
affecting nucleic acid synthesis. 

1 5 Agents that directly cross-link nucleic acids, specifically DNA, are envisaged and 

are shown herein, to eventuate DNA damage leading to a synergistic antineoplastic 
combination. Agents such as cisplatin, and other DNA alkylating agents can be used. 

Agents that damage DNA also include compounds that interfere with DNA 
replication, mitosis, and chromosomal segregation. Examples of these compounds 

20 include adriamycin (also known as doxorubicin), VP-16 (also known as etoposide), 
verapamil, podophyllotoxin, and the like. Widely used in clinical setting for the 
treatment of neoplasms these compounds are administered through bolus injections 
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intravenously at doses ranging from 25-75 mg/m2 at 21 day intervals for adriamycin, to 
35-100 mg/m2 for etoposide intravenously or orally. 

Cancer therapies also include a variety of combination therapies with both 
chemical and other types of treatments. Chemotherapeutics include, for example, 

5 cisplatin (CDDP), carboplatin, procarbazine, mechlorethamine, cyclophosphamide, 
camptothecin, ifosfamide, melphalan, chlorambucil, busulfan, nitrosurea, dactinomycin, 
daunorubicin, doxorubicin, bleomycin, plicomycin, mitomycin, etoposide (VP 16), 
tamoxifen, raloxifene, estrogen receptor binding agents, taxol, gemcitabien, navelbine, 
famesyl-proteintansferase inhibitors, transplatinum, 5-fluorouracil, vincristin, vinblastin 

10 and methotrexate, or any analog or derivative variant of the foregoing. 

Genes 

Gene Therapy Administration 

For gene therapy, a skilled artisan would be cognizant that the vector to be 
1 5 utilized must contain the gene of interest or a suitable fragment thereof operatively linked 
to a promoter. For antisense gene therapy, the antisense sequence of the gene of interest 
or a suitable fragment thereof would be operatively linked to a promoter. One skilled in 
the art recognizes that in certain instances other sequences such as a 3' UTR regulatory 
sequences are useful in expressing the gene of interest. Where appropriate, the gene 
20 therapy vectors can be formulated into preparations in solid, semisolid, liquid or gaseous 
forms in the ways known in the art for their respective route of administration. Means 
known in the art can be utilized to prevent release and absorption of the composition until 
it reaches the target organ or to ensure timed-release of the composition. A 
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pharmaceutical^ acceptable form should be employed which does not ineffectuate the 
compositions of the present invention. In pharmaceutical dosage forms, the compositions 
can be used alone or in appropriate association, as well as in combination, with other 
pharmaceutically active compounds. A sufficient amount of vector containing the 
5 therapeutic nucleic acid sequence must be administered to provide a pharmacologically 
effective dose of the gene product. 

One skilled in the art recognizes that different methods of delivery can be utilized 
to administer a vector into a cell. Examples include: (1) methods utilizing physical 
means, such as electroporation (electricity), a gene gun (physical force) or applying large 
10 volumes of a liquid (pressure); and (2) methods wherein said vector is complexed to 
another entity, such as a liposome, viral vector or transporter molecule. 

Accordingly, the present invention provides a method of transferring a therapeutic 
gene to a host, which comprises administering the vector of the present invention, 
preferably as part of a composition, using any of the aforementioned routes of 
1 5 administration or alternative routes known to those skilled in the art and appropriate for 
a particular application. Effective gene transfer of a vector to a host cell in accordance 
with the present invention to a host cell can be monitored in terms of a therapeutic effect 
(e.g. alleviation of some symptom associated with the particular disease being treated) 
or, further, by evidence of the transferred gene or expression of the gene within the host 
20 (e.g., using the polymerase chain reaction in conjunction with sequencing, Northern or 
Southern hybridizations, or transcription assays to detect the nucleic acid in host cells, 
or using immunoblot analysis, antibody-mediated detection, mRNA or protein half-life 
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studies, or particularized assays to detect protein or polypeptide encoded by the 
transferred nucleic acid, or impacted in level or function due to such transfer). 

These methods described herein are by no means all-inclusive, and further 
methods to suit the specific application will be apparent to the ordinary skilled artisan. 
5 Moreover, the effective amount of the compositions can be further approximated through 
analogy to compounds known to exert the desired effect. 

Furthermore, the actual dose and schedule can vary depending on whether the 
compositions are administered in combination with other pharmaceutical compositions, 
or depending on interindividual differences in pharmacokinetics, drug disposition, and 
10 metabolism. Similarly, amounts can vary in in vitro applications depending on the 
particular cell line utilized (e.g., based on the number of vector receptors present on the 
cell surface, or the ability of the particular vector employed for gene transfer to replicate 
in that cell line). Furthermore, the amount of vector to be added per cell will likely vary 
with the length and stability of the therapeutic gene inserted in the vector, as well as also 
15 the nature of the sequence, and is particularly a parameter which needs to be determined 
empirically, and can be altered due to factors not inherent to the methods of the present 
invention (for instance, the cost associated with synthesis). One skilled in the art can 
easily make any necessary adjustments in accordance with the exigencies of the particular 
situation. 

20 It is possible that cells containing the therapeutic gene canalso contain a suicide 

gene (i.e., a gene which encodes a product that can be used to destroy the cell, such as 
herpes simplex virus thymidine kinase). In many gene therapy situations, it is desirable 
to be able to express a gene for therapeutic purposes in a host cell but also to have the 
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capacity to destroy the host cell once the therapy is completed, becomes uncontrollable, 
or does not lead to a predictable or desirable result. Thus, expression of the therapeutic 
gene in a host cell can be driven by a promoter although the product of said suicide gene 
remains harmless in the absence of a prodrug. Once the therapy is complete or no longer 

5 desired or needed, administration of a prodrug causes the suicide gene product to become 
lethal to the cell. Examples of suicide gene/prodrug combinations which can be used are 
Herpes Simplex Virus-thymidine kinase (HSV-tk) and ganciclovir, acyclovir or FIAU; 
oxidoreductase and cycloheximide; cytosine deaminase and 5-fluorocytosine; thymidine 
kinase thymidilate kinase (Tdk::Tmk) and AZT; and deoxycytidine kinase and cytosine 

10 arabinoside. 

The method of cell therapy can be employed by methods known in the art wherein 
a cultured cell containing a copy of a nucleic acid sequence or amino acid sequence of 
Mathl is introduced. 

In yet another embodiment, the secondary treatment is a secondary gene therapy 
15 in which a second therapeutic polynucleotide is administered before, after, or at the same 
time a first therapeutic polynucleotide encoding all of part of an atonal-associated 
polypeptide. Delivery of a vector encoding either a full length or partial atonal- 
associated polypeptide in conjuction with a second vector encoding another gene product 
will have a combined anti-hyperproliferative effect on target tissues. Alternatively, a 
20 single vector encoding both genes can be used. 

Immunotherapy 
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Immunotherapies, generally, rely on the use of immune effector cells and 
molecules to target and destroy cancer cells. The immune effector can be, for example, 
an antibody specific for some marker on the surface of a tumor cell. The antibody alone 
canserve as an effector of therapy or it canrecruit other cells to actually effect cell killing. 

5 The antibody also can be conjugated to a drug or toxin (chemotherapeutic, radionuclide, 
ricin A chain, cholera toxin, pertussis toxin, etc.) and serve merely as a targeting agent. 
Alternatively, the effector can be a lymphocyte carrying a surface molecule that interacts, 
either directly or indirectly, with a tumor cell target. Various effector cells include 
cytotoxic T cells and NK cells. 

10 Immunotherapy, thus, could be used as part of a combined therapy, in conjunction 

with Ad-mda7 gene therapy. The general approach for combined therapy is discussed 
below. Generally, the tumor cell must bear some marker that is amenable to targeting, 
i.e., is not present on the majority of other cells. Many tumor markers exist and any of 
these can be suitable for targeting in the context of the present invention. Common 

1 5 tumor markers include carcinoembryonic antigen, prostate specific antigen, urinary tumor 
associated antigen, fetal antigen, tyrosinase (p97), gp68, TAG-72, HMFG, Sialyl Lewis 
Antigen, MucA, MucB, PLAP, estrogen receptor, laminin receptor, erb B and p!55. 

Combination Treatments 

20 It can be desirable in utilizing the present invention to combine the compositions 

with other agents effective in the treatment of hyperproliferative disease, such as 
anti-cancer agents. An "anti-cancer" agent is capable of negatively affecting cancer in 
a subject, for example, by killing cancer cells, inducing apoptosis in cancer cells, 
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reducing the growth rate of cancer cells, reducing the incidence or number of metastases, 
reducing tumor size, inhibiting tumor growth, reducing the blood supply to a tumor or 
cancer cells, promoting an immune response against cancer cells or a tumor, preventing 
or inhibiting the progression of cancer, or increasing the lifespan of a subj ect with cancer. 

5 More generally, these other compositions would be provided in a combined amount 
effective to kill or inhibit proliferation of the cell. This process caninvolve contacting the 
cells with the expression construct and the agent(s) or multiple factor(s) at the same time. 
This can be achieved by contacting the cell with a single composition or pharmacological 
formulation that includes both agents, or by contacting the cell with two distinct 

10 compositions or formulations, at the same time, wherein one composition includes the 
expression construct and the other includes the second agent(s). 

Tumor cell resistance to chemotherapy and radiotherapy agents represents a major 
problem in clinical oncology. One goal of current cancer research is to find ways to 

1 5 improve the efficacy of chemo- and radiotherapy by combining it with gene therapy. For 
example, the herpes simplex-thymidine kinase (HS-tK) gene, when delivered to brain 
tumors by a retroviral vector system, successfully induced susceptibility to the antiviral 
agent ganciclovir (Culver, et al., 1992). In the context of the present invention, it is 
contemplated that mda-7 gene therapy could be used similarly in conjunction with 

20 chemotherapeutic, radiotherapeutic, or immunotherapeutic intervention, in addition to 
other pro-apoptotic or cell cycle regulating agents. 

Alternatively, the gene therapy canprecede or follow the other agent treatment by 
intervals ranging from minutes to weeks. In embodiments where the other agent and 
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expression construct are applied separately to the cell, one would generally ensure that 
a significant period of time did not expire between the time of each delivery, such that 
the agent and expression construct would still be able to exert an advantageously 
combined effect on the cell. In such instances, it is contemplated that one cancontact the 
5 cell with both modalities within about 1 2-24 h of each other and, more preferably, within 
about 6-12 h of each other. In some situations, it can be desirable to extend the time 
period for treatment significantly, however, where several d (2, 3, 4, 5, 6 or 7) to several 
wk (1, 2, 3, 4, 5, 6, 7 or 8) lapse between the respective administrations. 

10 Various combinations can be employed, gene therapy is "A" and the secondary agent, 
such as radio- or chemotherapy, is "B": 

A/B/A B/A/B B/B/A A/A/B A/B/B B/A/A A/B/B/B B/A/B/B 

15 B/B/B/A B/B/A/B A/A/B/B A/B/A/B A/B/B/A B/B/A/A 

B/A/B/A B/A/A/B A/A/A/B B/A/A/A A/B/A/A A/A/B/A 

Administration of the therapeutic expression constructs of the present invention to a 
20 patient will follow general protocols for the administration of chemotherapeutics, taking 
into account the toxicity, if any, of the vector. It is expected that the treatment cycles 
would be repeated as necessary. It also is contemplated that various standard therapies, 
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as well as surgical intervention, can be applied in combination with the described 
hyperproliferative cell therapy. 



Inhibitors of Cellular Proliferation 

5 The tumor suppressor oncogenes function to inhibit excessive cellular 

proliferation. The inactivation of these genes destroys their inhibitory activity, resulting 
in unregulated proliferation. The tumor suppressors p53, pl6 and C-CAM are specific 
embodiments utilized in the present invention. Other genes that can be employed 
according to the present invention include Rb, APC, DCC, NF-1, NF-2, WT-1, MEN-I, 

10 MEN-H, zacl, p73, VHL, MMAC1 / PTEN, DBCCR-1, FCC, rsk-3, p27, P 27/pl6 
fusions, p2 l/p27 fusions, anti-thrombotic genes (e.g., COX- 1 , TFPI), PGS, Dp, E2F, ras, 
myc, neu, raf, erb, fins, trk, ret, gsp, hst, abl, El A, p300, genes involved in angiogenesis 
(e.g., VEGF, FGF, thrombospondin, BAI-1, GDATF, or their receptors) and MCC. 

1 5 Regulators of Programmed Cell Death 

Apoptosis, or programmed cell death, is an essential process for normal 
embryonic development, maintaining homeostasis in adult tissues, and suppressing 
carcinogenesis (Kerr et al., 1972). The Bcl-2 family of proteins and ICE-like proteases 
have been demonstrated to be important regulators and effectors of apoptosis in other 
20 systems. The Bcl-2 protein, discovered in association with follicular lymphoma, plays 
a prominent role in controlling apoptosis and enhancing cell survival in response to 
diverse apoptotic stimuli (Bakhshi et al., 1985; Cleary and Sklar, 1985; Cleary et al., 
1986;Tsujimotoetal., 1985; Tsujimoto and Croce, 1986). The evolutionarily conserved 
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Bcl-2 protein now is recognized to be a member of a family of related proteins, which can 
be categorized as death agonists or death antagonists. Different family members have 
been shown to either possess similar functions to Bcl-2 (e.g., BclXL, BcIW, BclS, Mcl-1 , 
Al , Bfl-1) or counteract Bcl-2 function and promote cell death (e.g., Bax, Bak, Bik, Bim, 
5 Bid, Bad, Harakiri). 



Other agents 

It is contemplated that other agents can be used in combination with the present 
10 invention to improve the therapeutic efficacy of treatment. These additional agents 
include immunomodulatory agents, agents that affect the upregulation of cell surface 
receptors and GAP junctions, cytostatic and differentiation agents, inhibitors of cell 
adehesion, or agents that increase the sensitivity of the hyperproliferative cells to 
apoptotic inducers. 

15 

Dosage and Formulation 

The nucleic acid sequences and amino acid seqeunces (active ingredients) of this 
invention can be formulated and administered to treat a variety of disease states by any 
means that produces contact of the active ingredient with the agent's site of action in the 
20 body of an animal. They can be administered by any conventional means available for 
use in conjunction with pharmaceuticals, either as individual therapeutic active 
ingredients or in a combination of therapeutic active ingredients. They can be 
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administered alone, or with a pharmaceutically acceptable carrier selected on the basis 
of the chosen route of administration and standard pharmaceutical practice. 

The dosage administered will be a therapeutically effective amount of active 
ingredient and will, of course, vary depending upon known factors such as the 
5 pharmacodynamic characteristics of the particular active ingredient and its mode and 
route of administration; age, sex, health and weight of the recipient; nature and extent of 
symptoms; kind of concurrent treatment, frequency of treatment and the effect desired. 

The active ingredient can be administered orally in solid dosage forms such as 
capsules, tablets and powders, or in liquid dosage forms such as elixirs, syrups, emulsions 

10 and suspensions. The active ingredient can also be formulated for administration 
parenterally by inj ection, rapid infusion, nasopharyngeal absorption or dermoabsorption. 
The agent can be administered intramuscularly, intravenously, or as a suppository. In 
addition, parenteral solutions can contain preservatives such as benzalkonium chloride, 
methyl- or propylparaben and chlorobutanol. Suitable pharmaceutical carriers are 

1 5 described in Remington *s Pharmaceutical Sciences, & standard reference text in this field- 
Additionally, standard pharmaceutical methods can be employed to control the 
duration of action. These are well known in the art and include control release 
preparations and can include appropriate macromolecules, for example polymers, 
polyesters, polyamino acids, polyvinyl, pyrolidone, ethylenevinylacetate, methyl 

20 cellulose, carboxymethyl cellulose or protamine sulfate. The concentration of 
macromolecules as well as the methods of incorporation can be adjusted in order to 
control release. Additionally, the agent can be incorporated into particles of polymeric 
materials such as polyesters, polyamino acids, hydrogels, poly (lactic acid) or 
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ethylenevinylacetate copolymers. In addition to being incorporated, these agents can also 
be used to trap the compound in microcapsules. 

Useful pharmaceutical dosage forms for administration of the compounds of this 
invention can be illustrated as follows. 
5 Capsules: Capsules are prepared by filling standard two-piece hard gelatin 

capsulates each with a therapeutically effective amount of powdered active ingredient, 
175 milligrams of lactose, 24 milligrams of talc and 6 milligrams magnesium stearate. 

Soft Gelatin Capsules: A mixture of active ingredient in soybean oil is prepared 
and injected by means of a positive displacement pump into gelatin to form soft gelatin 
10 capsules containing a therapeutically effective amount of the active ingredient. The 
capsules are then washed and dried. 

Tablets: Tablets are prepared by conventional procedures so that the dosage unit 
is a therapeutically effective amount of active ingredient 0.2 milligrams of colloidal 
silicon dioxide, 5 milligrams of magnesium stearate, 275 milligrams of microcrystalline 
15 cellulose, 11 milligrams of cornstarch and 98.8 milligrams of lactose. Appropriate 
coatings can be applied to increase palatability or to delay absorption. 

Injectable: A parenteral composition suitable for administration by injection is 
prepared by stirring L5% by weight of active ingredients in 10% by volume propylene 
glycol and water. The solution is made isotonic with sodium chloride and sterilized. 
20 Suspension: An aqueous suspension is prepared for oral administration so that 

each 5 millimeters contain a therapeutically effective amount of finely divided active 
ingredient, 200 milligrams of sodium carboxymethyl cellulose, 5 milligrams of sodium 
benzoate, 1.0 grams of sorbitol solution ILS.P. and 0.025 millimeters of vanillin. 
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Accordingly, the pharmaceutical composition of the present invention can be 
delivered via various routes and to various sites in an animal body to achieve a particular 
effect (see, e.g., Rosenfeld et al. (1991), supra; Rosenfeld et al., Clin. Res., 39(2), 311 A 
(1991a); Jaffe et al., supra; Berkner, supra). One skilled in the art will recognize that 
5 although more than one route can be used for administration, a particular route can 
provide a more immediate and more effective reaction than another route. Local or 
systemic delivery can be accomplished by administration comprising application or 
instillation of the formulation into body cavities, inhalation or insufflation of an aerosol, 
or by parenteral introduction, comprising intramuscular, intravenous, peritoneal, 

1 0 subcutaneous, intradermal, as well as topical administration. 

The composition of the present invention can be provided in unit dosage form 
wherein each dosage unit, e.g., a teaspoonful, tablet, solution, or suppository, contains 
a predetermined amount of the composition, alone or in appropriate combination with 
other active agents. The tercn "unit dosage form" as used herein refers to physically 

15 discrete units suitable as unitary dosages for human and animal subjects, each unit 
containing a predetermined quantity of the compositions of the present invention, alone 
or in combination with other active agents, calculated in an amount sufficient to produce 
the desired effect, in association with a pharmaceutically acceptable diluent, carrier, or 
vehicle, where appropriate. The specifications for the unit dosage forms of the present 

20 invention depend on the particular effect to be achieved and the particular 
pharmacodynamics associated with the pharmaceutical composition in the particular host. 

These methods described herein are by no means all-inclusive, and further 
methods to suit the specific application will be apparent to the ordinary skilled artisan. 
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Moreover, the effective amount of the compositions can be further approximated through 
analogy to compounds known to exert the desired effect. 



5 The following examples are offered by way of example, and are not intended to 

limit the scope of the invention in any manner. 

EXAMPLE 1 
Mouse atonal homolog 1 {Mathl) 

10 It has been found that the present methods for the treatment of the hearing 

impaired have failed to address the problem directly, that is, the regeneration of auditory 
hair cell populations. The present invention in a preferred embodiment is directed to a 
member of the bHLH family, the Mathl gene or an another atonal-associated nucleic 
acid sequence, and its requirement for generation of cerebellar granule neurons and inner 

15 ear hair cells. This discovery has wide ramifications not only for understanding 
neurodevelopmentbut also for therapies for a variety of prevalent disorders, as described 
below. 

The mouse atonal homolog 1 (Mathl) is expressed in the precursors of the 
cerebellar granule neurons; a few cells in the dorsal portion of the developing spinal cord; 
20 the inner ear; Merkel cells (touch receptors on the skins); and joints. Overexpressing 
Mathl in an otherwise differentiated cell can induce the formation or differentiation into 
a progenitor or mature inner ear hair cell-like cell. 
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Mathl expression in the precursors of the cerebellar granule neurons suggests it 
is required for function in the cerebellum and brain. The cerebellum is essential for fine 
motor coordination and posture, and its dysfunction disrupts balance, speech and limb 
movements. Cerebellar development typically begins at about embryonic day 9.5 (E9.5) 
5 when a small group of cells in the hindbrain proliferates and migrates rostrally to form 
the external granule layer, brain stem, and pontine neurons. This population of neuronal 
progenitors, which continues to express Mathl^ further proliferates and migrates 
internally to form the cerebellar granule neurons that are the predominant neuronal 
population in the cerebellum and brain. Mice that do not express Mathl completely lack 

10 cerebellar granule neurons and their precursors. Mathl is thus essential for the 
generation of these neurons and endows the very sparse population of neurons at E9.5 
with the ability to proliferate into billions and then differentiate (Ben-Arie et al., 1997). 
Both these functions are of great medical significance. To understand normal 
proliferation provides necessary insight into abnormal proliferation, as observed in 

1 5 cancer. Cerebellar tumors of the primitive neuroectodermal type (e.g. , medulloblastoma) 
are the most common solid malignancy in children. Mathl expressing cells contribute 
significantly to these tumors. 

Mathl is expressed in the non-ossified joint cartilage (see Figure 6) that typically 
degenerates in osteoarthritis. This is the most prevalent form of arthritis, with 90% of 

20 people over 40 showing some degree of osteoarthritis in one or more joints. Given the 
properties of Mathl in cellular generation and proliferation, its artificial expression in 
affected joints can allow regeneration of the cells that constitute non-ossified cartilage. 
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Disclosed herein are compositions and methods for the use of the Mathl gene, its 
human homolog (Hath J) or any of its homologs, orthologs, chimeric fusion proteins or 
derivatives of any suitable atonal-associated nucleic acid sequence or any another atonal- 
associated nucleic acid sequence. To leani about the functions of Mathl in mammals, 
5 the Mathl gene was deleted from a mouse using a strategy that permitted detection of 
cells that express Mathl. Disclosed are the creation and characterization of mice that 
can be used to screen for compounds which could be utilized to decrease or augment 
Mathl expression in inner ear hair cells and other cells in which Mathl expression is 
associated. 

10 Methods are also disclosed for the study, characterization and treatment of 

neoplastic proliferation of neuroectodermal origin since Mathl expression is essential for 
the generation and proliferation of cerebellar granule neurons. Also, it has been 
discovered that Mathl plays a role in the development of cells that produce non-ossified 
joint cartilage, which are associated with the development of osteoarthitis. These 

1 5 discoveries have led to a method of screening for compounds that can be helpful for the 
treatment of inner ear hair cell loss and other diseases that occur due to the functional loss 
of Mathl, such as osteoarthritis. 

More particularly, the present invention provides an animal heterozygous for 
Mathl gene inactivation or an another atonal-associated nucleic acid sequence, wherein 

20 at least one Mathl allele or another ato«a/-associated nucleic acid sequence has been 
replaced by insertion of a heterologous nucleic acid sequence, wherein the inactivation 
of the Mathl or atonal-associated sequence prevents expression of the Mathl or atonal- 
associated allele. The mouse can be further used to generate mice homozygous for 
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Mathl or another atonaZ-associated sequence gene inactivation and can further include 
a second heterologous nucleic acid sequence, wherein at least one of the heterologous 
genes is used to detect expression driven by the Mathl or atowaZ-associated sequence 
regulatory elements. The complete or partial inactivation of the functional Mathl or 
5 a*cmtf/-associated sequence can be detected in, e.g., proprioreceptory cells, granule 
neurons and their progenitor cells, or non-ossified cartilage cells. 

Examples of heterologous nucleic acid sequences are reporter sequences such as 
b-galactosidase, green fluorescent protein (GFP), blue fluorescent protein (BFP), 
neomycin, kanamycin, luciferase, b-glucuronidase and chloramphenicol transferase 
1 0 (CAT). The Mathl or afowaZ-associated sequence can also be replaced under the control 
of regulatable promoter sequences or can be a tissue-specific promoter sequences. Said 
promoter sequences can be partial or can contain the entire promoter. 

The present invention can also be used as, or as part of, a method for screening 
for a compound, wherein the administration of the compound affects a developmental 
1 5 and/or pathological condition wherein said condition is a result of reduction in expression 
of the Mathl or atonaZ-associated sequence, the method including, administering the 
compound to a transgenic mouse that is homozygous for Mathl or atowaZ-associated 
sequence inactivation, wherein at least one Mathl or flfoHaZ-associated allele is 
inactivated by insertion of a heterologous nucleic acid sequence, wherein the inactivation 
20 of the Mathl or atowaZ-associated sequence prevents expression of the Mathl or atonal- 
associated gene, and monitoring the mouse for a change in the developmental and/or 
pathological condition. The types of pathological conditions that can be examined 
include, but are not limited to loss of hair cells, loss of cerebellar granule neurons or their 
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precursors, lack of granule cell proliferation or migration, lack of cerebellar external 
granule layer cells, hearing impairment, an imbalance disorder, joint disease, 
osteoarthritis, abnormal proliferation of neoplastic neuroectodermal cells and formation 
of medulloblastoma. As used herein, the screen provides for a compound that by 
5 upregulating expression of a heterologous nucleic acid sequence is apositive effector and 
for a compound that by downregulating expression of a heterologous nucleic acid 
sequence is a negative effector. 

Yet another embodiment of the present invention is a method of promoting 
mechanoreceptive cell growth, that includes contacting a cell with a Mathl or atonal- 
10 associated protein or gene in an amount effective to cause said cell to express an inner 
ear hair cell marker. An example of a hair cell marker for use with the method is 
calretinin. The cell can be contacted with a vector that expresses a Mathl or atonal- 
associated nucleic acid sequence or amino acid sequence. Mathl or <zto«a/-associated 
nucleic acid sequence-expressing recombinant vectors can include an adenoviral vector, 
15 a retroviral vector, an adeno-associated vector, a plasmid, a liposome, a protein, a lipid, 
a carbohydrate and a combination thereof of said vectors. Mathl or ato«a/-associated 
sequence can be under the control of, e.g., a cytomegalovirus IE promoter sequence or 
the cytomegalovirus IE promoter sequence and a SV40 early polyadenylation signal 
sequence, or any other combination of appropriate promoter sequences, enhancer 
20 sequence, and polyadenylation. 

Furthermore, a method is disclosed for treating hearing impairment or an 
imbalance disorder that includes administering to an animal, including a human, with 
hearing loss or an imbalance disorder a therapeutically effective amount of a Mathl or 



71 



WO 00/73764 PCT/USOO/15410 

<zrona/-associated amino acid sequence or nucleic acid sequence. The hearing or balance 
impairment can be complete or partial and can affect either one ear or both ears. In a 
preferred embodiment, there is a substantial impairment of hearing. Hearing and an 
imbalance disorder can be affected separately or concomitantly in an animal to be 
5 treated, and said hearing and/or an imbalance disorder could be as a result of trauma, 
disease, age-related condition, or could be due to loss of hair cells for any reason. 

The present invention is also directed to a composition that includes a Mathl or 
atowaZ-associated protein or gene in combination with a delivery vehicle, wherein the 
delivery vehicle causes a therapeutically effective amount of Mathl or afowaZ-associated 
10 sequence to be delivered into a cell. The delivery vehicle can be further defined as a 
vector that comprises a Math 1 or afonaZ-associated amino acid sequence or nucleic acid 
sequence in an animal cell. The vector can be a retroviral or an adenoviral vector or any 
other nucleic acid based vector which can even be dispersed in a pharmacologically 
acceptable formulation, and used for intralesional administration. The composition can 
1 5 even be a partially or fully purified protein that is delivered using a liposome, a protein, 
a lipid or a carbohydrate that promotes the entry of aMathl or atowaZ-associated protein 
into a cell. Examples of proteins that can be used as delivery vehicles include the 
receptor-binding domains (the non-catalytic regions) of bacterial toxins, such as, e.g., 
Exotoxin A, cholera toxin and Ricin toxin or protein transduction domains, such as from 
20 the HIV TAT protein (Schwarze et al., 1999) (see Example 22). The composition for 
delivering Mathl can be a fusion protein. 
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A skilled artisan is aware that methods to treat animals as disclosed in the 
invention can be either in utero or after birth. Treatment can be given to an embryo and 
can occur either ex vivo or in vivo. 

5 EXAMPLE 2 

Animal Model for Organogenesis 

An effective animal model for deficiency in a gene that controls organogenesis 
will most often have both alleles stably inactivated so that, throughout embryogenesis, 
one or more tissues cannot revert to a functional wild-type allele. One method of 

10 generating animals with an altered genotype is gene targeting (Mansour et al., 1993), in 
which homologous recombination of newly introduced DNA sequence (i.e., the targeting 
sequence or construct) and a specific targeted DNA sequence residing in the chromosome 
results in the insertion of a portion of the newly introduced DNA sequence into the 
targeted chromosomal DNA sequence. This method is capable of generating animals of 

15 any desired genotype, and is especially useful for gene disruption (i.e., to "knock out") 
at a specific chromosomal gene sequence by inserting a selectable marker into the gene 
or completely replacing the gene with another nucleotide sequence. 

To knock out a genomic sequence, a cloned fragment must be available and 
intron-exon boundaries within the fragment defined (Mansour et al., 1993). Typically, 

20 the targeting construct contains a selectable marker such as Neo (neomycin resistance, 
see Mansour et al., 1993) flanked by sequences homologous to the chromosomal target 
DNA, and beyond one of these flanking sequences the herpes simplex virus thymidine 
kinase gene (HSV-TK, see generally, McKnight et al., 1980). The targeting construct is 
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introduced, e.g., by electroporation, into embryo-derived stem (ES) cells where 
homologous recombination results in an insertion of the Neomycin resistance marker 
(Neo), but not the HS V-TK gene, into the targeted chromosomal DNA sequence. The 
altered ES cells are neomycin resistant and HSV-TK" and so are able to grow in the 
5 presence of both G418 and gancyclovir antibiotics. Random insertions contain the 
HS V-TK gene and are thus sensitive to gancyclovir (Mansour, et al.). Positive ES clones 
are then microinjected into blastocysts to generate germ-line chimeric mice, which are 
then bred to obtain progeny that are homozygous for the knock out gene. Such general 
methods of generating knock out animals have been demonstrated using mice. Genes in 
1 0 other animals such as rats, guinea pigs, gerbils, hamsters, and rabbits, canalso be used as 
long as sufficient DNA sequence data are available to make an appropriate targeting 
construct to knock out the gene of interest. 



Although ato and Mathl share a high degree of sequence conservation, there was 
15 an apparent discrepancy between their expression patterns and the consequences of their 
loss of function. Whereas ato is expressed primarily in the PNS of the fly and its absence 
causes loss of almost all CHOs (Jarman et al., 1 993), Mathl is expressed in the CNS and 
its loss leads to absence of cerebellar granule neurons, the largest neuronal population in 
the CNS (Ben-Arie et al., 1997). To better understand the functional relations between 
20 ato and Mathl , the present invention describes generation of a second Mathl null allele 
in mice (Mathl^^^) by replacement of UxoMathl coding region with a b-galactosidase 
gene (lacZ) and performing a subsequent search for CNS expression of ato in the fruit 
fly. The Examples describe a functional link between ato and Mathl: ato is expressed 
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in the fly brain, and lacZ expression under the control of Math J regulatory elements 
(Mathl/lacZ) not only replicated the known expression pattern in the CNS (i.e., the 
neural tube, spinal cord and cerebellum), but appeared in many other cells of the murine 
PNS. Overexpression of Mathl mDrosophila caused ectopic CHO formation, providing 

5 further evidence that ato and Mathl are functionally conserved. 

The connections and consistency of the relationship between atonal mDrosophila 
and Mathl in the mouse suggests that their use as model systems in the art is justified. 
A family of homologues have been cloned and analyzed in the mouse including 
MATH1,2,3,4A,4B,4C and5(Azakawaetal. 5 1995; Bartholoma and Nave, 1994;Ben- 

10 Arie et al., 1997; Ben-Arie et al, 1996, Fode et al., 1998; Ma et al., 1998; McCormick 
et al. ? 1996; Shimizu et al., 1995; Takebayashi et al., 1997). AXenopus atonal homolog, 
Xathl has been ectopically expressed in Drosophila and shown to behave similarly to ato 
(Kim et al., 1997). Furthermore, the ability of Mathl to induce ectopic CHO formation 
and to restore CHOs to ato mutant embryos (see Example 13) is strong evidence that 

15 Mathl, andparticularly its basic domain, encodes lineage identity information not unlike 
that encoded by ato and that mammalian cells expressing Mathl are functionally similar 
and perhaps evolutionarily related to Drosophila cells that require ato. Thus, the 
similarities between atonal in Drosophila, Xathl in Xenopus and Mathl in the mouse 
indicate that these animals are comparable animal model systems. Furthermore, the 

20 widespread use of mice in particular as a model system for humans also suggests that it 
similarly would allow utilization of the invention in humans. 

With advances in molecular genetics now standard in the art, sequences from 
humans and other species can be used interchangeably in a variety of organisms. For 
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example, the rat inducible hsp70 gene was used to produce transgenic mice that 
overexpressed inducible hsp70> allowing organs from transgenic mice to be protected 
from ischemic injury (Marber et al. J. clin. Invest, 95:1446-1456 (1995)) due to the 
increase in rat hsp70. Sequences in other animals have been interchanged including 

5 between humans and rodents to develop rodent models to study human disease, i.e. 
neurodegenerative diseases. One such example is the expression of the human SCA1 
gene, which encodes ataxin-1, in mice (Burright, E. N. et al Cell 82:937-948 (1995)), 
Transgenic mice were generated expressing the human SCA1 gene with either a normal 
or an expanded CAG tract. The data illustrated that the expanded CAG repeats were 

1 0 expressed in sufficient amounts in the Purkinje cells to produce degeneration and ataxia. 
This example illustrates that a mouse model can be established to study spinocerebellar 
ataxia type 1 , which is an autosomal dominant inherited neurologic disorder. In addition 
to developing mouse models, Drosophila is a hallmark model system in the field. 
Warrick et al. (1 999) produced transgenic flies which co-expressed human hsp70 and a 

15 human mutant polyglutamine (MJDtr-Q78). Expression of the human mutant 
polyglutmnnG MJDtr-Q78 alone in the flies resulted in the formation of large aggregates 
in neurons. However, co-expression with human hsp 70 resulted in suppressed 
aggregation. These examples illustrate that interchangeability of genes is routine in the 
field of molecular genetics and model systems provide powerful tools to characterize 

20 gene function. 

EXAMPLE 3 
Generation of Transgenic Math 1 Mice 
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To detect subtle Mathl expression patterns not identified by RNA in situ 
hybridization, and thus further illuminate this gene's role during embryonic development, 
Mathl null alleles (Mathl^ GaI/ ^ Gal ) were generated by replacing the Mathl coding region 
with p-galactosidase ((5-Gal). 

5 The targeting construct, containing a lacZ cassette and a PGK-neo cassette (Fig. 

7 A), was used to replace the Mathl coding region. To delete the entire coding region of 
Mathl, a targeting construct was generated that contained the 5' and 3' genomic flanking 
fragments as described previously (Ben-Arie et aL, 1997) flanking a pSAbgal/PGK-neo 
cassette (Friedrich and Soriano, 1991). The construct is designed so that lacZ expression 

10 is driven by endogenous Mathl control elements, while an independent PGK promoter 
drives the expression of the selectable marker neo. 

The construct was electroporated into ES cells and selection for neo was achieved 
with G418. Fourteen out of 76 (18%) clones underwent homologous recombination. 
Genotyping of ES cells, yolk sac and tail DNA was performed using Southern analysis 

15 of EcoR I digested DNA and probes previously described (Ben-Arie et aL, 1997). The 
targeting construct was electroporated into embryonic stem (ES) cells; 14/76 (18%) 
clones exhibited correct homologous recombination at the Mathl locus (Fig. 7B). 

Three ES cell lines carrying the Mathl* 1 *-®* allele were injected into host 
blastocysts to generate chimeric mice. Mathl +/b ' ml mice were identified and intercrossed 

20 to generate homozygotes (Fig. 7C). The Mathl deletion was confirmed by Southern 
analysis using both flanking and internal probes (Fig. 7A). 

Mathl^ 0 ^ 001 rmce show all the phenotypic features reported in thoMathl~ A mice 
(Ben-Arie et aL, 1997; 2000). 
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EXAMPLE 4 

X-gal staining, histological and immunohistocheinical analyses 

Embryos were staged by vaginal plug, with the morning of the plug designated 
E0.5. Embryos were dissected out of the uterus, separated from extraembryonic 

5 membranes, and placed in cold phosphate buffered saline (PBS). The embryos were then 
fixed in 4% paraformaldehyde (PFA) in PBS for 30 minutes, and washed in cold PBS. 
Yolk sacs or tails were collected before fixation for DNA extraction and genotyping. 
Equilibration to improve the penetrability of the staining reagents was performed in 
0.02% NP40, 0.01% sodium deoxycholate in PBS for 10 minutes at room temperature. 

1 0 Whole mount staining with X-gal (Bonnerot and Nicolas, 1 993) was performed for 1 6-24 
hours at 30oC while shaking in the same equilibration buffer, which also contained 5mM 
potassium ferricyanide, 5mM potassium ferrocyanide, and 40mg/ml X-gal (dissolved in 
DMSO). When the desired intensity of staining was achieved, usually within 18 hours, 
embryos were washed in PBS, postfixed for 30 minutes in buffered formalin, serially 

1 5 dehydrated in 25, 50, and 70% ethanol, and stored at 4°C. 

For histological analysis embryos were further dehydrated in 80, 90, and 100% 
ethanol, treated in Histoclear (National Diagnostics), and embedded in Paraplast (Oxford 
Labware). Seven to 20 \im sections were cut using in a microtome (Microme). 
Counterstaining was performed using nuclear fast red (Vector Laboratories). 

20 Immunohistochemistry was performed as detailed previously (Ben-Arie et aL, 1997). 
Antibodies: Anti-cytokeratin 18 (DAKO) 1:20; Anti-human Chromogranin A (DAKO) 
1:100; Anti-MATHl (see below) 1:200. 
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EXAMPLE 5 
Expression Patterns in Transgenic Mathl Mice 

As expected, ji-Gal expression in the cerebellum and dorsal spinal cord is 
identical to that of Mathl , and interestingly, P-Gal is also expressed throughout the otic 
5 vesicle epithelia at El 2.5 and in the sensory epithelia of the utricle, saccule, semicircular 
canals, and cochlea at El 4.5 andE15.5 (Figures lAand IB). Utricles were obtained from 
C57BL/129SVEVmice. 

Gross morphological analysis of the inner ear of MathP' Gai/ ^' Gal mice at E 1 8 . 5 , one 
day before full gestation, revealed no obvious defects in overall structure and size 
10 compared with wild type (wt) littermates. The branches of the VIII th cranial nerve were 
present and reached the epithelia, but degenerated due to absence of the hair cells. 

The sensory epithelia were examined in detail. The utricles and cochleas of wild- 
type, Mathl +/ ^ Ga \ and Mathl^' Gal/ ^ Gal micG were excised to allow viewing of the sensory 
epithelia with Nomarski optics. Hair bundles were present in both organs of wild-type 
15 and heterozygotes, but were completely absent in Mathl null litter-mates. Scanning 
electron microscopy (SEM) of the cochlea and vestibular organs confirmed the absence 
of hair bundles in null mice (Figures 2A through 2F). To determine whether lack of hair 
bundles reflects the absence of hair cells, cross-sections of the sensory epithelia of all 
inner ear organs using both light and transmission electron microscopy (LM and TEM, 
20 respectively) were examined (Figures 3A through 3F). LM and TEM were carried out 
as described previously (Lysakowski and Goldberg, 1997). Tissue preparation for SEM 
consisted of osmication (1% Os0 4 in cacodylate buffer), dehydration, critical-point 
drying, sputter-coating with gold, and examination in a JEOL 35S electron microscope. 
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Light microscopy revealed that sensory epithelia in null mice are considerably 
thinner, lack the normal stratification of cell nuclei and stain uniformly, all of which are 
consistent with the absence of hair cells. TEM clearly distinguishes between hair cells 
and supporting cells in normal utricles: hair cells have hair bundles, less electron-dense 
5 cytoplasm, more apical nuclei, and no secretory granules (Figures 4A and 4B). The 
sensory epithelia of the null mutants lack hair cells entirely but do have supporting cells 
with normal appearance (Rusch, et al.,1998), including electron-dense cytoplasm, basal 
nuclei, and secretory granules. However heterozygous Mathl +/l>Gai mice retainhair cells. 

10 EXAMPLE 6 

Expression of a Hair Cell Specific Marker in Transgenic Mathl Mice 
Lack of hair cells at El 8.5 can be due to (1) lack of sensory cell progenitors, (2) 
the inability of progenitors to differentiate into hair cells, or (3) the inability of hair cells 
to maintain the differential states, as has been observed in the absence of the POU 
1 5 domain transcription factor Brn3c. The first possibility is unlikely because progenitors 
give rise to both hair cells and supporting cells. To evaluate the remaining possibilities, 
the expression of the hair cell specific marker, calretinin and myosin VI were examined. 
Cahetinin is a member of the calcium binding family of proteins and is expressed in 
differentiating hair cells (prior to hair bundle formation) and mature inner ear and 
20 auditory hair cells, but not in supporting cells. Cahetinin expression in Mathl^ CaV ^ al 
and wild-type mice was studied by immunofiuorescense on coronal sections of E15.5, 
E16.5 and El 8.5 embryos (Figures 5 A through 5F). 
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For immunofluorescence, embryos were fixed for 1.5 hours in 4% 
paraformaldehyde/PBS at 4°C, sunk through 15% sucrose/PBS for 5 hours then 30% 
sucrose/PBS overnight, and snap frozen in a 2-methylbutane dry ice bath. 14 
sections were cut on a cryostat and mounted onto gelatin-coated slides. Sections were 

5 fixed onto slides by dipping for 10 minutes in Streck tissue fixative (Streck laboratories) 
and air drying. Sections were blocked in 30% normal goat serum and 0.3% triton X-100 
in PBS for 1 hr at room temperature (RT). Rabbit anti-calretinin polyclonal antibody 
(Chemicon laboratories) was diluted 1 :200 in blocking solution and incubated overnight 
on sections at 4°C. Sections were washed 3 times (20 minutes each) in Phosphate- 

10 Buffered Saline (PBS) at RT. The secondary antibody anti-rabbit antibody, Alexa 488 
(Molecular Probes), was diluted 1 :400 in blocking solution and used to detect calretinin. 
Sections were covered and incubated at RT for 2 hours before washing and mounting in 
Vectashield containing DAPI (Vector). For confocal microscopy, sections were treated 
with 25 jig/ml RNAse before counterstaining with 50 ng/ml of propidium iodide and 

1 5 mounted in Vectashield without DAPI. Stained sections were viewed under a Bio-Rad 
1024 confocal microscope. 

Calretinin-positive cells are clearly visible in the sensory epithelia of the 
semicircular canals and utricles of wild-type mice, but MaihP~ Gal/ ^ Gal embryos lack 
calretinin expression at all three states. Using the mouse model disclosed herein the 

20 present inventors demonstrate that hair cells never develop within the sensory epithelia 
oiMathl^ GaV ^ Gal mice. The presence of the tectorial and otolithic membranes (secreted 
in part by the supporting cells), together with the TEM results, suggests that the 
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remaining cells in the sensory epithelia of the Mathl^ Gai ^ Gal mice are functional 
supporting cells. 

EXAMPLE 7 

5 MathlAacZ expression mimics Mathl expression in the developing CNS 

The developing cerebellum at E14.5 and postnatal day 0 (P0) in Mathl**"* and 
Mathl b-s^-s 31 mice were analyzed by RNA in situ hybridization analysis. 

The analysis showed that the expression pattern of the lacZ gene faithfully 
reproduced the Mathl expression pattern observed by RNA in situ hybridization analysis 
10 shown previously (Akazawa et al., 1995; Ben-Arie et al., 1996) (Fig. 2A, B, E, G). 
Moreover, the cerebellar phenotype in Mathl "-^^ mice (Fig. 8F and 8H) was identical 
to that observed in Mathl null mice (Ben-Arie et al., 1997). At E14.5, the precursors of 
the EGL are present in the rhombic lip from which they migrate over the cerebellar 
anlage to populate the EGL (Fig. 8E). Mutant mice displayed far fewer of these cells 
1 5 than heterozygous mice (Fig. 8F). At PO, the neurons of the external granule layer (EGL) 

were completely lacking (Fig. 8H). 

MathlAacZ expression in the developing hind brain and spinal cord similarly 
reproduced the expression pattern of Mathl (Fig. 8 C, 8D). The only notable difference 
between the expression patterns established by in situ hybridization and lacZ staining is 
20 that b-galactosidase expression persists in differentiating or migrating cells of the spinal 
cord because of the stability of the b-GAL protein (Fig. 8D). In summary, the neural 
tissue expressionpattern and cerebellar phenotype associated with the replacement of the 
Mathl coding region by lacZ is consistent with previously published data on Mathl 
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expression (Akazawaetal., 1995;Ben-Arieetal., 1997;Ben-Arieetal., 1996; Helms and 
Johnson, 1 998), demonstrating that the endogenous control elements were not disrupted 
by insertion of the lacZ gene. Moreover, many previously undetected clusters of 
/acZ-expressing cells became apparent upon X-gal staining of whole embryos and 
5 sections in Matht^* mice (see below). It is likely that limitations in the spatial 
resolution ofRNA in situ hybridization techniques used to detect the transcript in earlier 
studies prevented these sites of expression from being discerned (Akazawa et al., 1995; 
Ben-Arie et al., 1996). Alternatively, the stability of the lacZ gene product and the 
increased sensitivity due to signal amplification allowed us to identify sites of relatively 
10 low expression levels. 



EXAMPLE 8 
Mathl/lacZ is expressed in inner ear sensory epithelia 

The sensory organs of the inner ear were among the newly identified sites of 
15 Mathl/lacZ expression, demonstrated utilizing the methods described in Example 2. 
Expression in the otic vesicle was first detected at E12.5 and continued until El 8.5 
throughout much of the sensory epithelia (Bermingham et al., 1999) (Figure 9A, 9B). 
NullmutantsdisplayedMa^MacZexpressionintheinner ear throughout embryogenesis 
(Fig. 9C). Math! null mutants lack hair cells in all of the sensory organs (Bermingham 
20 et al., 1999), but maintain supporting cells, the other sensory epithelia-derived cells (Fig. 
9C). These supporting cells seem to be functional, based on their morphology and the 
presence of overlying membranes secreted in part by these cells. Although the 
expression of Mathl in inner ear sensory epithelia was not demonstrated by RNA in situ 
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hybridization analysis, the complete lack of inner ear hair cells in the null mutants leaves 
little doubt about the authenticity of the MaihlflacZ expression pattern. 

Maihl is clearly essential for hair cell development in the inner ear. Its 
expression pattern and in vivo function are akin to those of Mathl 's proneural homolog, 
5 atonal (ato) (A. P. Jarman, Y. Grau, L. Y. Jan, Y. N. Jan, Cell 73, 1307-21 (1994)). ato 
is expressed in a ring of epithelial cells within the antennal disc of Drosophila. Some of 
these epithelial cells will subsequently develop into mechanoreceptors in the Johnston 
organ, which is necessary for hearing and negative geotaxis. It is interesting to note that 
mechanoreceptor progenitor cells are absent in ato mutants, whereas only the 
10 mechanoreceptors, and not their progenitors, are absent in Mathl null mice. 

Based on the observations made herein, the present inventors have recognized that 
Mathl is required for the specification of inner ear hair cells. In a sense, Mathl acts as 
a "pro-hair cell gene" in the developing sensory epithelia. In conjunction with two recent 
studies, the present inventors have recognized that the results provided herein provide 
15 evidence supporting a lateral inhibition model for the determination of hair cells and 
supporting cells (Haddon et al., 1998; Adam, et al., 1998), in which the interplay of 
Delta, Notch, and Serratel results in the selection of individual hair cells from clusters 
of competent cells. Such a model entails that the sensory epithelia express a "pro-hair 
cell gene" whose function is essential for hair cell fate specification. 
20 The ectopic expression of ato in the fruitfly and its homolog Xathl in Xenopus 

(Kim et al.,1997) can recruit epithelial cells into specific neuronal fates, and the 
expression of Mathl in inner ear epithelia strongly suggests loss of a functional Mathl 
gene is likely to be a common cause of deafness and vestibular dysfunction. 



84 



WO 00/73764 PCT/US00/1 5410 

EXAMPLE 9 
MathlAacZ is expressed in brain stem nuclei 

In the brainstem Mathl/lacZ staining appeared from El 8.5 to P7 in the ventral 
pons in the regions corresponding to the pontine nuclei (Fig. 9D and inset). This finding 

5 is consistent with the hypothesis of Akazawa and colleagues that Mathl -positive cells in 
the developing hind brain are precursors to the bulbopontine neurons (Akazawa et al., 
1995), No such staining appeared in the null mutants (Fig. 9E and inset). These data 
raise the possibility that the absence of lacZ staining in pontine nuclei can be due to 
failure of their precursors to migrate, proliferate, and/or differentiate. Ventral pontine 

1 0 nuclei were examined upon haematoxylin and eosin staining of sections and were found 
to be missing in the brain stem of null mice (Fig. 9F, G). Furthermore, the failure of null 
mouse newborns to breathe can be due to absence of these brainstem neurons. 

EXAMPLE 10 

15 MathlAacZ is expressed in chondrocytes 

Mathl +/b - GaI heterozygotes displayed expression of Mathl in articular cartilage 
(Figures 6A and 6B). Figure 6A demonstrates expression in all joints of a forelimb. 
Upon closer examination of an elbow joint, Mathl is noted to be expressed exclusively 
in the non-ossified articular chondrocytes. 
20 Expression of Mathl/lacZ was detected in the developing proximal joints, such 

as those of the hip and shoulder, as early as E12.5 (Fig 10A). X-gal positive staining was 
detected at subsequent developmental stages in a progressive proximal-distal pattern that 
paralleled the normal development of joints (Figure 10B). In the joints, Mathl/lacZ 
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expression immediately follows mesenchymal condensation, which begins at El 1.5. 
Condensed mesenchyme cells differentiate into chondrocytes (Bi et al. ? 1999; Horton et 
al., 1993;Karsenty, 1998). 

Chondrocytes differentiate in three major phases during bone formation: resting, 

5 proliferating and hypertrophic. The resting chondrocytes that populate the articular 
cartilage are referred to as articular chondrocytes (Buckwalter and Mankin, 1998; Poole, 
1997). Prior to birth, resting chondrocytes constitute the entire chondrocyte population 
in joints. To establish which cells expressed Mathl/lacZ, sections from El 8.5 and P7 
Mathl +/b ^ mice were stained with X-gal. Mathl/lacZ is expressed in the resting 

10 chondrocytes of all joints analyzed at E18.5; resting chondrocytes in the elbow joint are 
shown in Figure IOC, and Fig. 10D shows the resting, proliferating, and articular 
chondrocytes of a P7 mouse. 

Thejoints ofE18.5 embryos were examined with anti-MATHl antibody prepared 
by the following methods. An EcoR l-Hind IE fragment encoding the N-terminal 156 

15 amino acids of the Mathlopsn reading frame (MathlD) was cloned into the pET 28a+ 
expression vector (Novagen). MathlD fragment was expressed as a His tag fusion 
protein. Soluble MATH ID protein was purified according to His-tag kit specifications 
(Novagen) and 2mg of protein were used to immunize Chickens (Cocalico Biologicals 
Inc.). 

20 Expression was found in resting chondrocytes, whereas no expression was 

observed in null embryos. It should be noted that not all articular cartilage cells express 
Mathl/lacZ (Fig. 10E). Mathl/lacZ expression in Mathl null mutants is similar to that 
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in heterozygous mice at El 8.5, suggesting that Mathl is not required for resting 
chondrocyte development. 



EXAMPLE 11 

5 Mathl/lacZ is expressed in Merkel cells 

By El 4.5 Mathl /lacZ-positive cells were apparent around the vibrissae and in the 
skin of much of the body (Fig. 10B). In the trunk, the stained cells were arranged in a 
striped pattern defined by the epidermal ridges. This staining was apparent only in the 
hairy, not the glabrous, skin. All the primary (mystical) vibrissae, including the lateral 

10 nasal, maxillary and four large hairs, were positive for Mathl/lacZ. Staining was also 
detected in the secondary vibrissae, including the labial, submental, rhinal, and isolated 
orbital vibrissae (supra-, infra- and post-orbital) (Y amakado and Yohro, 1979). By E15.5 
staining appeared in clusters of cells in the foot pads (Fig. 10B). 

To identify the Mathl /lacZ-positivQ cells in the vibrissae, footpad, and hairy skin, 

15 we examined histological sections ftomMathr^®* mice (Fig. 1 1 A-D). Sections through 
the vibrissae showed that the stained cells are localized to the more apical half of the hair 
shaft, but are not in the hair itself. Cross sections through the foot pad illustrated staining 
of cluster of cells in the epidermal layer (Fig. 1 1 B,C). As shown in Fig. 1 ID, sections 
through the truncal skin identified clusters of Mathl VlacZ-stamed cells. The stained cells 

20 were arranged in a horseshoe-shaped pattern centered within an elevated button-like 
structure in the hairy skin. These button-like structures were identified as touch domes 
or Haarscheiben (Pinkus, 1905), which are characterized by a thickened epidermis and 
an elevated dermal papilla with a capillary network. Touch domes are associated with 
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large guard hairs dispersed between other hair types in the coat. The spatial distribution 
of Mathl/lacZ-stained cells, the timing of their appearance at E14.5, and their 
localization within the mystical pads of the vibrissae and the touch domes in the hairy 
skin suggest that these cells correspond to Merkel cells, specialized cells in the epidermis 
5 that form slow-adapting type I mechanoreceptor complexes with neurites (Munger, 
1991). 

The results of comparative analysis of the Mathl/lacZ expression pattern in 
heterozygous and homozygous El 6.5 animals are shown in Figure 1 1 E-L. Mathl ^s^s^ 
embryos displayed a staining pattern similar to that of Mathl^^ x littermates in the 

10 vibrissae and footpads (Fig. 1 1E-G, I-K). In contrast, staining in the touch domes of the 
hairy skin was barely detectable in Mathl ^e^s* embryos (Fig. 1 1H,L). The reduction 
of staining in null animals was also obvious at El 8.5. 

To further define Mathl VfacZ-positive cells in the skin, Mathl +/b ~^ } mice were 
mated to Tabby mice. Tabby (Ta) is a spontaneous X-linked mutation displaying a 

1 5 similar phenotype inhemizygous males andhomozygous females (Ferguson et al., 1 997). 
Tabby mutants lack hair follicles (tylotrich), a subset of Merkel cells that are associated 
with touch domes in the hairy skin of the trunk (Vielkind et aL, 1995), and some of the 
five secondary vibrissae on the head (Gruneberg, 1971). Hence, in a cross of Ta/Ta 
females with a heterozygous Mathl***** 1 male, 50% of the male progeny are Ta/Y: 

20 Mathl^^ allowing us to assess whether the Mathl /lacZ-positive cells correspond to 
Merkel cells. 
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Ta/Ta females were time-mated with Mathl**** males, and embryos were 
harvested at E16.5. Each pup's gender was determined by PCR on tail DNA, using 
primers (forward 5'-TGAAGCTTTTGGCTTTGAG-3'; SEQ ID NO:67, and reverse 
5'- CCGCTGCC AAATTCTTTGG-3' ; SEQ ID NO:68) that yielded a 320 bp product 
5 from chromosome X, and a 300 bp product from chromosome Y (Liu et al., 1999). 
Amplification conditions were: 92°C/1 min, 55°C/1 min, 72°C/1 min for 32 cycles, with 
an initial denaruration step of 94°C/7 min and last extension step of 72°C/7 rnin. 
Amplification products were separated on 2% agarose gels. X-gal-stained embryos were 
scored independently by 2 individuals, and only then were results matched with the 

10 determined gender. 

Both Tabby females and males carrying the MathF*^ allele displayed X-gal 
staining in the vibrissae and foot pads (Fig. 12A3)- The effect of the Tabby mutation 
on the number of secondary vibrissae was quite clear: hemizygous males completely 
lacked Mathl /lacZ-^osiiiwQ cells in the secondary vibrissae (typically lacking in 7a 
15 mutants) and on the trunk (Fig. 12E). Females that are heterozygous for Tabby showed 
patchy staining in the touch domes (although less than wt ), as should be anticipated in 
female carriers of a mutation in a gene that undergoes random X chromosome 
inactivation (Fig. 12C,D). The localization and distribution of the positive cells, as well 
as their absence in selected vibrissae and the trunk of Tabby males, strongly indicate that 
20 Mathl is expressed in the Merkel cells associated with guard follicles in the touch domes 
of the hairy skin. 

To ascertain whether Mathl/lacZ staining pattern reflects normal Mathl 
expressionpattern, immunohistochernical analysis of MATH 1 was performed on sections 
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from abdominal skin (see Example 2). As seen in Fig. 1 3 A and B, MATH 1 -positive cells 
were detected around the hair follicles of Mathl +/+ but not Math 1 ^s^s* 1 mice. Antibodies 
against two Merkel cells markers were chosen for further analysis: anti-cytokeratinl8, 
expressed in simple epithelia, and chromogranin, localized to secretory granules of 

5 neuroendocrine, endocrine, and neuronal tissues. Both cytokeratin 18 (Fig. 13C,D) and 
chromogranin A (Fig. 13E,F) confirmed the identity of the Ma?/zMacZ-positive cells as 
Merkel cells, but did not reveal staining abnormalities in Mathl^ 1 ^^ mice. Thus, 
Mathl does not seem to be essential for the genesis of the neuroendocrine Merkel cells, 
in contrast to pure neuronal cell types like cerebellar EGL and pontine nuclei. Because 

10 Mathl null mutants die at birth, we can not assess whether the entire cluster of Merkel 
cells is formed or the functional integrity of Merkel cells in these mutants is affected. 



EXAMPLE 12 
Mathl partially rescues Chinese Hamster 
15 Ovary Cells in flies deleted for ato 

This Example demonstrates that atonal-associated genes can induce the 
development of CNS cells in animals deficient in a native atonal-associated gene or gene 
product. This Example also demonstrates that atona/-associated genes can 
therapeutically function in species in which they are not natively expressed. 
20 Given the remarkable similarity in expression patterns of ato and Mathl, and their 

identical basic domains, Mathl was tested to see if it would mimic the effects of ato 
overexpression by producing ectopic chordotonal organs as described by the following 
methods. Wild-type, also known as yw flies, were transformed with a UAS-Mathl 
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construct as described (Brand and Perrimon, 1 993). To overexpress Mathl in wild type 
flies,jw; UAS-Mathl flies were mated to HS-Gal4 flies. Theprogeny were heat shocked 
as previously described (Jarman et al., 1993). To rescue the loss of chordotonal organs 
in ato mutant flies, w; UAS-Mathl /UAS-Mathl; atol/TM6 flies were crossed to w; 

5 HS-GaWCyO; atol/TM6 flies. Embryos were collected for 3 hr., aged for 3 hr., heat 
shocked for 30 min. at 37°C and allowed to develop for the next 12-1 5 hr. Embryos were 
fixed in 4% formaldehyde in PBS with 50% heptane. Embryos were washed with 1 00% 
ethanol, transferred to PBT and stained with mAb 22C10 as previously described (Kania 
etal., 1995) to detect PNS neurons. Chordotonal neurons were identified by their distinct 

1 0 morphology and position. 

Expressing Mathl during pupal development by heat shock using the UAS-Gal4 
system (Brand and Perrimon, 1993) resulted in supernumerary external sense organs on 
the notum (Fig. 14A,B) and the wing blade, as reported for ato (Jarman et aL, 1 993) and 
the Achaete-Scute complex (AS-C) genes (Brand and Perrimon, 1993; Rodriguez et al., 

15 1990). Mathl expression in flies, like ato, produced ectopic chordotonal organs (Fig. 
8G), although with less efficiency. Overexpression of thcAS-C genes does not, however, 
result in ectopic chordotonal organs (Jarman et al., 1993), Mathl thus has a similar 
functional specificity to ato. 

Since several ato enhancers are ato-dependent (Sun et al., 1998), they can be 

20 activated by Mathl, which would then lead to ectopic CHO specification. To determine 
whether Mathl can substitute for ato function in the fly, and to rule out the possibility 
that production of CHOs by Mathl is due to ato activation, Mathl was expressed in ato 
mutant embryos. The mutants lack all chordotonal neurons (Fig. 14C), but 
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overexpressing Math J partially rescues the loss of these neurons (Fig. 14D) in a manner 
similar to ato (Chien et al., 1996). 



EXAMPLE 13 

5 Significance of atonal and Mathl in the CNS and PNS 

Over the past few years significant progress has been made towards unraveling 
the roles ofbHLH proteins in vertebrate neurogenesis . Neural vertebrate bHLH-encoding 
genes were isolated and characterized because Drosophila homologues such as ato or the 

1 0 AS-C genes had been previously shown to be required for neurogenesis (Anderson, 1 995 ; 
Guillemot, 1046 1995; Lee, 1997; Takebayashi et al., 1997). Indeed, several genes were 
shown to be proneural because their absence caused a failure of neuroblast or sensory 
organprecursor (SOP) specification, whereas their overexpression lead to the recruitment 
of supernumerary neuronal precursors (Ghysen and Dambly-Chaudiere, 1 989). With the 

15 exception of neurogenin (Ngn) 1 and 2 (Fode et al., 1998; Ma et al., 1998), it remains 
uncertain which of the vertebrate homologues play roles similar to their Drosophila 
counterparts, and what precise role different bHLH proteins play in neural development. 
In Drosophila, ato is required for the development of a specific subset of sense organs, 
the chordotonal organs (Jarman et aL, 1993). CHOs are internal mechanosensors of the 

20 PNS (Mclver, 1985). Thus, ato and the CHOs provide an excellent system in which to 
ascertain not only the molecular and developmental relationship between invertebrate and 
vertebrate neurogenesis vis-a-vis the function of the proneural genes, but also the 
evolutionary conservation of sensory organ function and specification. Seven ato 
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homologues have been cloned and analyzed in the mouse: Mouse Atonal Homologues 
(MATH) 1, 2, 3 t 4A (also known as Ngn2\ 4B {Ngn3\ 4C (Ngnl), and 5 (Akazawa et al., 
1995; Bartholoma and Nave, 1994; Ben-Arie et al,, 1997, 1996; Fode et al., 1998; Ma et 
al., 1998; McCormick et al., 1996; Shimizu et al., 1995; Takebayashi et al., 1997). Most 

5 are expressed during neurogenesis in both the CNS and PNS. These homologues vary in 
the degree of their sequence conservation, and canbe divided into three groups. The most 
distantly related group, the neurogenins, includes Ngn 1, 2 and 3. These gene products 
share, on average, 53% identity in the bHLH domain with ATO. They are expressed 
largely in mitotic CNS and sensory ganglia progenitor cells. Recent work suggests that 

1 0 these genes canplay a role in neuroblast determination, and cantherefore be true proneural 
genes (Fode et al., 1998; Ma et al., 1998). The second group includes MATH2 and 
MATED, which share 57% identity in the bHLH domain with ATO. These proteins have 
been postulated to function in postmitotic neural cells (Bartholoma and Nave, 1994; 
Shimizu et al., 1995). Math2 expression is confined to the CNS, while Math3 is 

15 expressed in both the CNS and the trigeminal and dorsal root ganglia. The third group 
includes MATH1 and MATHS, which share 67% and 71% identity with the bHLH 
domain of ATO, respectively. It is noteworthy that both genes encode a basic domain 
identical to that of ATO. Interestingly, the basic domain of ATO was shown to be 
sufficient, in the context of another proneural protein (SCUTE), to substitute for the loss 

20 of ato function (Chien et al., 1996). Mathl was initially shown to be expressed in the 
precursors of the cerebellar EGL and in the dorsal spinal cord (Ben-Arie et al., 1997, 
1996). MathS is expressed in the dividing progenitors in the developing retina and in the 
vagal ganglion (Brown et al., 1998). With the exception of MathS expression in the 
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neural retina, these observations pose a paradox: none of the vertebrate homologs 
appeared to be expressed in peripheral organs or tissues similar to those where ato is 
expressed. Jarman et al. ( 1 993 ) reported that ato is expressed in the CNS . In the examples 
described herein it is shown that ? in addition to the inner proliferation center of the optic 

5 lobe, ato is expressed in a small anteriomedial patch of cells in each brain lobe (Fig. 8F). 
Because it remains unclear, however, precisely what role ato plays in Drosophila CNS 
development, it has been difficult to argue that ato and its vertebrate homologues display 
functional conservation. Our experiments reveal sites of previously uncharacterized 
Mathl expression. As expected, we found that Mathl/lacZ expression in the CNS 

10 corresponds to that of Mathl, but we also found that Mathl is expressed in the skin, the 
joints, and the inner ear, in striking parallel to ato expression in the fly. Moreover, the 
expression in the ear (sensory epithelium) and the skin (Merkel cells) is restricted to 
sensory structures whose function is to convert mechanical stimuli into neuronal 
electrochemical signals. It is important to point out that in Drosophila, ato appears to play 

15 two roles simultaneously. It is required not only to select the precursors of the CHOs 
(proneural role), but also to specify these precursors as CHO precursors (lineage identity 
role) (Jarman and Ahmed, 1998; Jarman et aL, 1993). The specificity of Mathl 
expression in the periphery makes it tempting to speculate that it, too, canendow specific 
cells with very specific lineage identities to distinguish them functionally from other 

20 sensory structures. The ability of Mathl to induce ectopic CHO formation and to restore 
CHOs to ato mutant embryos supports the notion that Mathl, and particularly its basic 
domain, encodes lineage identity information not unlike that encoded by ato. This 
suggests that the mammalian cells expressing Mathl, at least in the ear and the skin, are 
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functionally similar and perhaps evolutionarily related to Drosophila cells that require 
ato. Furthermore, Math5 expression in the neural retina suggests that the functions of 
atonal in the fly are carried out by two genes in the mouse: the development of some 
mechanoreceptors is under the control of Mathl and retinal development is possibly 
5 under the control of Math5. It is interesting to note that in the fully sequenced nematode 
C. elegans, only one homolog of atonal, lin-32, was identified (Zhao and Emmons, 1 995). 
Mutants with the u282 allele of lin-32 are touch-insensitive, which strengthens the 
argument for evolutionary conservation of atonal function in mechanoreception. The 
pattern of Mathl/lacZ expression in the pontine nuclei suggested this region should be 
10 carefully evaluated in null mutants. Although no defects in the pons of Mathl null mice 
(Ben-Arie et al., 1997) were originally detected, closer analysis revealed the lack of 
pontine nuclei at this site. These neurons derive from the rhombic lip (Airman and Bayer, 
1996) as do the EGL neurons, which are also lacking in Mathl null mice. While it is 
possible to draw parallels between Mathl and ato expression in the skin and ear, it is not 
15 clear that such is the case for the joints, ato expression in the fly joints is required for the 
formation of leg CHOs. In contrast, Mathl is expressed in resting and articular 
chondrocytes mat do not have any described neural function, and for which no parallels 
exist in the fly. It can be that Mathl expression in cartilage indicates a novel role for a 
mechanosensory gene, or it cansimply reflect similarities in the molecular events 
20 underlying the development of the various Mathl -expressing cell types. Alternatively, 
CHOs canalso function as joint structural elements in the fly, or articular cartilage 
canhave a mechanoreceptive or transducive capacity yet to be described. There is no 
evidence at this point to support one or another of these possibilities. Analyzing the 
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functions of ato and Mathl will enhance our understanding of neural development and 
the evolutionary conservation of sensory function. The sites and specificity of Mathl 
expression canmake it suitable as a tool of gene therapy or gene activation approaches 
to illnesses such as hearing loss and osteoarthritis that are due to age-related or 
5 environmental damage. 



EXAMPLE 14 

^4to«a/-Associated Nucleic Acid Delivery using Adenovirus 



1 0 Human adenoviruses are double-stranded DNA tumor viruses with genome sizes 

of approximate 36 kb. As a model system for eukaryotic gene expression, adenoviruses 
have been widely studied and well characterized, which makes them an attractive system 
for development of adenovirus as a gene transfer system. This group of viruses is easy 
to grow and manipulate and they exhibit a broad host range in vitro and in vivo. In 

15 lytically infected cells, adenoviruses are capable of shutting off host protein synthesis, 
directing cellular machineries to synthesize large quantities of viral proteins, and 
producing copious amounts of virus. 

The El region of the genome includes E1A and E1B, which encode proteins 
responsible for transcription regulation of the viral genome, as well as a few cellular 

20 genes. E2 expression, including E2A and E2B, allows synthesis of viral replicative 
functions, e.g. DNA-binding protein, DNA polymerase, and a terminal protein that 
primes replication. E3 gene products prevent cytolysis by cytotoxic T cells and tumor 
necrosis factor and appear to be important for viral propagation. Functions associated 
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with the E4 proteins include DNA replication, late gene expression, and host cell shutoff 
The late gene products include most of the virion capsid proteins, and these are expressed 
only after most of the processing of a single primary transcript from the major late 
promoter has occurred. The major late promoter (MLP) exhibits high efficiency during 
5 the late phase of the infection. 

As only a small portion of the viral genome appears to be required in cis, 
adenovirus-derived vectors offer excellent potential for the substitution of large DNA 
fragments when used in connection with cell lines such as 293 cells. Ad5-transformed 
human embryonic kidney cell lines have been developed to provide the essential viral 
10 proteins in trans. The inventors thus reasoned that the characteristics of adenoviruses 
rendered them good candidates for use in targeting Mathl deficient cells in vivo. In 
another embodiment these constructs include a Hathl or any afowaJ-associated nucleic 
acid sequence. 

Particular advantages of an adenovirus system for delivering foreign proteins to 
15 a cell include: (i) the ability to substitute relatively large pieces of viral DNA by foreign 
DNA; (ii) the structural stability of recombinant adenoviruses; (iii) the safety of 
adenoviral administration to humans; (iv) lack of any known association of adenoviral 
infection with cancer or malignancies; (v) the ability to obtain high titers of the 
recombinant virus; and (vi) the high infectivity of Adenovirus. 
20 One advantage of adenovirus vectors over retroviruses is a higher level of gene 

expression. Additionally, adenovirus replication is independent of host gene replication, 
unlike retroviral sequences. Because adenovirus transforming genes in the El region can 
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be readily deleted and still provide efficient expression vectors, oncogenic risk from 
adenovirus vectors is thought to be negligible. 

In general, adenovirus gene transfer systems are based upon recombinant, 
engineered adenovirus that is rendered replication-incompetent by deletion of a portion 
5 of its genome, such as El, and yet still retains its competency for infection. Relatively 
large foreign proteins can be expressed when additional deletions are made in the 
adenovirus genome. For example, adenoviruses deleted in both El and E3 regions are 
capable of carrying up to 10 Kb of foreign DNA and can be grown to high titers in 293. 
Surprisingly, persistent expression of transgenes following adenoviral infection is 
10 possible. Use of the adenovirus gene transfer system can be more useful for the delivery 
of Mathl to cells in nascent or damaged cartilage in joints. In particular, the Mathl 
adenovirus can be used to deliver Mathl and confer Mathl gene expression in, non- 
ossified joint cartilage that has been damaged as a consequence of osteoarthritis. 



15 EXAMPLE 15 

Afaf/ti-Adenovirus Constructs 

Recombinant virions for the controlled expression of Mathl can be constructed 
to exploit the advantages of adenoviral vectors, such as high titer, broad target range, 
efficient transduction, and non-integration in target cells for the transformation of cells 
20 into hair cells. In one embodiment these constructs include a Hathl or any atonal- 
associated nucleic acid sequence. In one embodiment of the invention, a replication- 
defective, helper-independent adenovirus is created that expresses wild type Mathl 
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sequences under the control of the human cytomegalovirus promoter or the 
metallothionine promoter. 

Control functions on expression vectors are often provided from viruses when 
expression is desired in mammalian ceils. For example, commonly used promoters are 

5 derived from polyoma, adenovirus 2 and simian virus 40 (SV40). The early and late 
promoters of S V40 virus are particularly useful because both are obtained easily from the 
virus as a fragment which also contains the S V40 viral origin of replication. Smaller or 
larger SV40 fragments canalso be used provided there is included the approximately 250 
bp sequence extending from the HindSJ site toward the BgR site located in the viral 

10 origin of replication. Further, it is also possible, and often desirable, to use promoter or 
control sequences normally associated with the Mathl gene sequence, namely the Mathl 
promoter, provided such control sequences are compatible with the host cell systems or 
the target cell. One such target cell is located in the inner ear of a human patient in need 
of inner ear hair cells. 

1 5 An origin of replication can be provided by construction of the vector to include 

an exogenous origin, such as can be derived from SV40 or other viral {e.g.* polyoma, 
adeno, VS V, BPV) source, or can be provided by the host cell chromosomal replication 
mechanism. If the vector is integrated into the host cell chromosome, the latter is often 
sufficient. 

20 

EXAMPLE 16 
^tonaf-Associated Nucleic Acid Delivery using Retrovirus 
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Another approach for gene delivery capitalizes on the natural ability of viruses 
to enter cells, bringing their own genetic material with them. Retroviruses have promise 
as gene delivery vectors due to their ability to integrate their genes into the host genome, 
transferring a large amount of foreign genetic material, infecting a broad spectrum of 
5 species and cell types and because they are easily packaged in special cell-lines. 
Retroviruses can be particularly useful for the delivery of Mathl into inner ear hair cells 
that have reduced expression of Mathl , or that are in need of over-expression of Mathl. 



10 EXAMPLE 17 

Mathl Retroviral Constructs 

The Mathl open reading frame (OKF) was excised from pBluescript by an EcdK 
l-Xbal digest. The fragment was gel purified, and blunt ended using Klenow DNA 
polymerase. The retroviral vector pLNCX (purchased from CLONTECH) was linearized 

1 5 with Hpal, and ligated with the Mathl ORF fragment. The ligation was transformed into 
transformation competent E. coli cells. The resulting antibiotic resistant colonies were 
assayed for the presence of the correct construct. 

The cloning, reproduction and propagation retroviral expression vectors is well 
known to those of skill in the art. One example of a retroviral gene transfer and 

20 expression system that has been used to express Mathl is the CLONTECH pLNCX, 
pLXSN and LAPSN expression vectors. For propagation of these vectors PT67 and 
EcoPack packaging cell lines can be used. For more information on mammalian cell 
culture, the following general references can be used: Culture of Animal Cells, Third 
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Edition, edition by R. I. Freshney (Wiley-Liss, 1993); and Current Protocols in 
Molecular Biology >, ed. By F. M. Ausubel, et aL, (Greene Publishing Associates and 
Wiley & Sons, 1994), relevant portions incorporated herein by reference. 

In another embodiment these constructs could be constructed with a Hathl or any 
5 atonal-associated nucleic acid sequence. 

EXAMPLE 18 
Maintenance of Packaging Cell Lines 
The maintenance of packaging cell lines, such as the 293 and PT67 packaging cell 
lines, is described briefly. A vial of frozen cells is transferred from liquid N 2 to a 37 °C 
10 water bath until just thawed. In order avoid osmotic shock to the cells, and to maximize 
cell survival, 1 ml of (Dulbecco's Modified Eagle Medium) DMEM is added to the tube 
and the mixture is transferred to a 15 -ml tube. Another 5 ml of DMEM is added and the 
cells are mixed. After repeating these steps the final volume in the tube should be about 
12 ml. Next, the cells are centrifuged at 500 x g for 1 0 min. Finally, the supernatant is 
15 removed and the cells are resuspended in maintenance media as described in the next 
step. Generally, the cells are maintained in DMEM (high glucose: 4.5 g/L) containing 
10% Fetal Bovine Serum (FBS), and 4 mM L-glutamine. If desired or necessary, 100 
U/ml penicillin/1 00 y,g/m\ streptomycin can be added. It is recommended that are plated 
at 3-5 x 10 5 per 100-mm plate and split every 2 to 3 days, when they reach 70-80% 
20 confluency (confluence is 3-4 x 1 0 6 per 1 00-mm plate). The PT67 cell line, for example, 
has a very short doubling time (<1 6h) and should be split before they become confluent. 
The doubling time for EcoPack-293 cells is 24-36 h. 
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Cells are split be removing the medium and washing the cells once with PBS. 
After treatment with 1-2 ml of trypsin-EDTA solution for 0.5-1 min, 5 to 10 ml of media 
and serum is added to stop trypsinization. The cells are dispersed gently, but thoroughly, 
by pipetting and are resuspended. Alternatively, a predetermined portion of the cells is 
5 replated in a 100-mm plate in 10 ml of medium, followed by rotation or shaking of the 
plate to distribute the cells evenly. A ratio of up to 1 :20 for the PT67 or EcoPack-293 
cells is common. 

Generally, the percentage of PT67 or EcoPack-293 cells capable of packaging 
retroviral vectors decreases slowly with continued passage of the cell line. Therefore, 
1 0 packaging cells should be reselected after 2 months of growth in culture. Alternatively, 
new high-titer cells can be purchased from, e.g., CLONTECH, or low passage number 
stocks can be frozen, stored and thawed to increase the viral yield. 



15 EXAMPLE 19 

Methods Utilizing a Retroviral Vector 

The following protocol is used to transfect the retroviral vector for virus 
production, infection of target cells, and selection of stable clones. Other methods and 
vectors canalso be used with the present invention to express Mathl, such as those 
20 described in Retroviruses, ed. by J. M. Coffin & EL E. Varmus (1996, Cold Spring 
Harbor Laboratory Press, NY) and Current Protocols in Molecular Biology, ed. by F. M. 
Ausubel et al (1994, Greene Publishing Associates and Wiley & Sons), incorporated 
herein by reference. 
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Briefly, the transfection of the retroviral vector into PT67 cells was as follows. 
Math 1 was cloned into pLNX as described hereinabove. The packaging cells were plated 
to a density of 5-7 x 10 5 cells per 100-mm plate 12-24 hours before transfection. 1-2 
hours before transfection, the medium replace with fresh medium. 25 /uM chloroquine 
5 can be added j ust prior to transfection. Chloroquine increases transfection efficiency 2-3 
fold. A 25 mM stock solution of chloroquine can be made in distilled water and filter 
sterilized. 

To each 100-mm plate 10-15 jug of plasmid DNA using the desired method is 
transfected using, e.g., standard calcium-phosphate procedures (CalPhos Mammalian 

1 0 Transfection Kit, #K2050- 1 ). The final volume of transfection mixture shouldnot exceed 
1 ml. The transfection solution is added to the medium and the plate is rotated to ensure 
even distribution. About 8 hours after transfection, a glycerol shock treatment can be 
performed to increase the uptake of DNA. After 10 to 24 hours post-transfection the 
medium was removed and the cells were washed twice with PBS, before adding 5 ml 

15 DMEM containing 10% FBS. The culture was incubated for an additional 12-48 hours 
to allow increase in virus titer. The virus titer reaches a maximum ~ 48 hours post- 
transfection and is generally at least 30% of maximum between 24 and 72 hours post- 
transfection. 

Alternatively, a stable virus-producing cell lines canalso be selected. To obtain 
20 stable virus-producing cell lines, the transfected packaging cells are plated in a selection 
medium 2-3 days post-transfection. For G41 8 selection of neomycin resistance, the cells 
are selected in the presence of G41 8 (0.5 mg/ml "active") for one week. Vectors carrying 
other selectable markers such as Puro, Bleo, or Hyg, can be used to obtain stable virus 
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producing cell populations as well . Cell populations producing virions that produce titers 
of 10 5 -10 6 recombinant virus particles per ml are common. Generally, 10 5 -10 6 
recombinant virus particles per ml is suitable for most purposes. For some studies, 
higher titer clones can be required. In this case, after antibiotic selection, individual 
5 clones are selected using, e.g., clone cylinders or limiting dilution, prior to propagation. 

Viral titer can be determined in a variety or ways, one such method is described 
hereinbelow. The viral titer produced by transiently transfected or stable virus-producing 
packaging cell lines is determined as follows, NIH/3T3 cells are plated one day prior to 
beginning the titer procedure. Cells are plated in 6-well plates at a density of 5 x 10M 

10 x 10 5 cells per well and 4 ml of media are added per well. Virus-containing medium is 
collected from packaging cells, and polybrene is added to a final concentration of 4 
yug/ml. The medium is filter-sterilized through a OAS-jxm filter. Polybrene is a 
polycation that reduces the charge repulsion between the virus and the cellular 
membrane. The filter should be cellulose acetate or polysulfonic (low protein binding) 

15 but not nitrocellulose. Nitrocellulose binds proteins in the retroviral membrane, and 
consequently destroys the virus. Serial dilutions are prepared as follows: six 10-fold 
serial dilutions are usually sufficient. To dilute the virus use fresh medium containing 
4 jug/ml of polybrene. Next, NIH/3T3 target cells are infected by adding virus-containing 
medium to the wells. After 48 hours, the NIH/3T3 cells are stained. The titer of virus 

20 corresponds to the number of colonies present at the highest dilution that contains 
colonies, multiplied by the dilution factor. For example, the presence of four colonies 
in the 10 5 dilution would represent a viral titer of 4 x 10 5 . 
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For the infection of cells, the following procedure was followed. The target cells 
were plated 12-18 hours before infection at a cell density of 3-5 x 10 5 per 100-mm plate. 
For the infection of cells that can be used for a biological assay, control cells can be 
treated with an insert-free virus produced under identical conditions. Half-maximal 
5 infection generally occurs after 5-6 hours of exposure of cells to virus, with maximal 
infection occurring after approximately 24 hours of exposure. The actual reverse 
transcription and integration of the retrovirus takes place within 24-3 6 hours of infection, 
depending on cell growth kinetics. Expression can be observed at 24 hours, and reaches 
a maximum at approximately 48 hours. Alternatively, infections can be conducted 
1 0 sequentially, about 1 2 hours apart. Sequential infection generally increases the efficiency 
of infection and also increases viral copy number. A minimum of 1 2 hours between each 
infection is recommended in order to ensure that cellular receptors will be unoccupied 
by viral envelope. 



15 EXAMPLE 20 

Screening Assays 

Finally, the present invention also provides candidate substance screening 
methods that are based upon whole cell assays, in vivo analysis and transformed or 
immortal cell lines in which a reporter gene is employed to confer on its recombinant 
20 hosts a readily detectable phenotype that emerges only under conditions where Mathl 
would be expressed, is under-expressed or is over-expressed. Generally, reporter genes 
encode a polypeptide not otherwise produced by the host cell that is detectable by 
analysis, e.g., by chromogenic, fluorometric, radioisotopic or spectrophotometric 
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analysis. In the present invention the Mathl gene has been replaced with P-galactosidase 
in a mouse. 

An example of a screening assay of the present invention is presented herein. 
Mathl expressing cells are grown in microtiter wells, followed by addition of serial molar 
proportions of the small molecule candidate to a series of wells, and determination of the 
signal level after an incubation period that is sufficient to demonstrate, e.g., calretinin 
expression in controls incubated solely with the vehicle used to resuspend or dissolve the 
compound. The wells containing varying proportions of candidate are then evaluated for 
signal activation. Candidates that demonstrate dose related enhancement of reporter gene 
transcription or expression are then selected for further evaluation as clinical therapeutic 
agents. The stimulation of transcription can be observed in the absence of expressed 
Mathl , in which case the candidate compound might be a positive stimulator of hair cell 
differentiation. Alternatively, the candidate compound might only give a stimulation in 
the presence of low levels of Mathl, which would suggest that it functions to stabilize 
the formation of Mathl dimers or the interaction of Mathl with one or more 
transcriptional factors. Candidate compounds of either class might be useful therapeutic 
agents that would stimulate production of inner ear hair cells and thereby address the 
need of patients with hearing loss or balance control impairments. 

EXAMPLE 21 
Transfection of Cells with Mathl Retroviral Vectors 

The present invention provides recombinant host cells transformed or transfected 
with a polynucleotide that encodes Mathl, as well as transgenic cells derived from those 
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transformed or transfected cells. In another embodiment these constructs could be 
constructed with a Hath J or any a£cwa/-associated nucleic acid sequence. A recombinant 
host cell of the present invention is transfected with a polynucleotide containing a 
functional Mathl nucleic acid sequence or a chimeric Maihl gene. Methods of 
5 transforming or transfecting cells with exogenous polynucleotides, such as DNA 
molecules, are well known in the art and include techniques such as calcium-phosphate- 
or DEAE-dextran-mediated transfection, protoplast fusion, electroporation, liposome 
mediated transfection, direct microinjection and adenovirus infection. 

Mathl expression using recombinant constructs can be used to target the delivery 

1 0 of Math J to cells in need thereof. Different promoter- vector combinations can be chosen 
by a person skilled in these arts to drive Math 1 expression in different cell types. In some 
cases, the desired outcome cannot be protein, but RNA, and recombinant vectors would 
include those with inserts present in either forward or reverse orientations. In addition, 
some vectors, for instance retroviruses or artificial recombination systems, can be 

15 designed to incorporate sequences within a cellular or viral genome in order to achieve 
constitutive or inducible expression of protein or RNA. 

Many of the vectors and hosts are available commercially and have specific 
features that facilitate expression or subsequent purification. For instance DNA 
sequences to be expressed as proteins often appear as fusion with unrelated sequences 

20 that encode polyhistidine tags, or HA, FLAG, myc and other epitope tags for 
immunochemical purification and detection, or phosphorylation sites, or protease 
recognition sites, or additional protein domains such as glutathione S-transferase (GST), 
maltose binding protein (MBP) (New England Biolabs), and so forth that facilitate 
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purification. Vectors canalso be designed that contain elements for polyadenylation, 
splicing, and termination, such that incorporation of naturally occurring genomic DNA 
sequences that contain introns and exons can be produced and processed, or such that 
unrelated introns and other regulatory signals require RNA processing prior to production 
5 of mature, translatable RNAs. Proteins produced in the systems described above are 
subject to a variety of post-translational modifications, such as glycosylation, 
phosphorylation, nonspecific or specific proteolysis or processing. 



EXAMPLE 22 

10 Delivery of Mathl as an Amino Acid Sequence 

A peptide (1 1 amino acids) derived from HIV has been recently described that 
when fused to full length proteins and injected into mice allow a rapid dispersal to the 
nucleus of all cells of the body (Schwarze et al., 1999). Schwarze et ah made fusion 
proteins to Tat ranging in size from 15 to 120 kDa. They documented a rapid uptake of 

1 5 the fusion proteins to the nuclei of cells throughout the animal, and the functional activity 
of said proteins was retained. 

In an embodiment of the present invention there are constructs containing the Tat 
or Tat-HA nucleic acid sequence operatively linked to a Mathl nucleic acid sequence. 
In another embodiment these constructs include a Hathl or any atonal-associated nucleic 

20 acid sequence. The vectors are expressed in bacterial cultures and the fusion protein is 
purified. This purified Tat-Mathl protein or Tat-Hathl protein is injected into animal to 
determine the efficiency of the Tat delivery system into the inner ear, skin, cerebellum, 
brain stem, spinal cord and joints. Analysis is carried out to determine the potential of 
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the Tat-Mathl/Tat-Hathl protein in hair cell and neuronal regeneration. This is a viable 
therapeutic approach either in its own right or in association with other methods or genes. 

It should be understood that the methods to screen for compounds which affect 
Maihl expression disclosed herein are useful notwithstanding that effective candidates 
5 cannot be found, since it is of practical utility to know what upstream effector is 
necessary for Maihl transcription. 
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What is claimed is: 

1 . An animal having a heterologous nucleic acid sequence replacing an allele of an 
atonaZ-associated nucleic acid sequence under conditions wherein said 

5 heterologous sequence inactivates said allele. 

2. The animal of claim 1 wherein said heterologous nucleic acid sequence is 
expressed under the control of an afowaZ-associated regulatory sequence. 

3. The animal of claim 1 wherein both atonaZ-associated alleles are replaced. 

4. The animal of claim 3 wherein said heterologous nucleic acid sequences are 
10 nonidentical. 

5. The animal of claim 3 wherein said animal has a detectable condition. 

6. The animal of claim 5 wherein said detectable condition is selected from the 
group consisting of loss of hair cells, cerebellar granule neuron deficiencies, 
hearing impairment, an imbalance disorder, joint disease, osteoarthritis and 

1 5 abnormal proliferation of cells. 

7. The animal of claim 1 or 3 wherein said heterologous nucleic acid sequence is a 
reporter sequence. 

8. The animal of claim 1 or 3, wherein said a/onaZ-associated allele is replaced with 
an a/onaZ-associated nucleic acid sequence under the control of regulatable 

20 promoter sequence. 

9. The animal of claim 1 or 3, wherein said atonaZ-associated allele is replaced with 
an atowaZ-associated nucleic acid sequence under the control of a tissue-specific 
promoter sequence. 
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10. The animal of claim 1 or 3 wherein said animal is selected from the group 
consisting of a mouse, Drosophila, zebrafish, frog, rat, hamster and guinea pig. 

11. A method for screening for a compound in an animal wherein said compound 
affects expression of an atonal-associated nucleic acid sequence comprising: 

delivering said compound to said animal, wherein said animal has at least 
one allele of an atona/-associated nucleic acid sequence inactivated by insertion of a 
heterologous nucleic acid sequence, wherein said heterologous nucleic acid 
sequence is under control of an atonal-associated regulatory sequence; and 

monitoring for a change in said expression of said atonaZ-associated 
nucleic acid sequence. 

12. The method of claim 1 1 wherein said compound affects expression of an atonal- 
associated nucleic acid sequence. 

1 3 . The method of claim 1 1 wherein said compound affects a detectable condition in 
an animal. 

14. The method of claim 11, wherein said heterologous nucleic acid sequence is a 
reporter sequence. 

15. A method for screening for a compound in an animal, wherein said compound 
affects a detectable condition in said animal, comprising: 

delivering said compound to said animal wherein at least one allele of an 
afonaZ-associated nucleic acid sequence in said animal is inactivated by insertion of a 
heterologous nucleic acid sequence, wherein said heterologous nucleic acid sequence is 
under the control of an afowaZ-associated regulatory sequence, and 
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monitoring said animal for a change in the detectable condition. 

16. The method of claim 11 or 15 wherein said compound affects said detectable 
condition. 

17. The method of claim 15 wherein said compound affects expression of said 
5 heterologous nucleic acid sequence. 

18. A method of treating an animal with a deficiency in cerebellar granule neurons 
or their precursors comprising delivery of a therapeutically effective amount of 
an afonaZ-associated amino acid sequence or nucleic acid sequence to a cell of 
said animal. 

10 19. A method of promoting mechanoreceptive cell growth in an animal, comprising 
delivering a therapeutically effective amount of an atonal-associated amino acid 
sequence or nucleic acid sequence to a cell of said animal. 

20. A method of generating hair cells comprising delivering a therapeutically 
effective amount of an aforcaZ-associated amino acid sequence or nucleic acid 

1 5 sequence to a cell of said animal. 

21 . A method of treating an animal for hearing impairment, comprising delivering a 
therapeutically effective amount of an atonal-associated amino acid sequence or 
nucleic acid sequence to a cell of said animal. 

22. A method of treating an animal for an imbalance disorder, comprising delivering 
20 a therapeutically effective amount of an aZowaZ-associated amino acid sequence 

or nucleic acid sequence to a cell of said animal. 
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23. A method of treating an animal for a joint disease comprising delivering a 
therapeutically effective amount of an ato?za/-associated amino acid sequence or 
nucleic acid sequence to a cell of said animal. 

24. A method of treating an animal for an abnormal proliferation of cells comprising 
5 delivering a therapeutically effective amount of an ato/zaZ-associated amino acid 

sequence or nucleic acid sequence to a cell of said animal. 

25 . A method of treating an animal for an abnormal proliferation of cells comprising 
altering atona/-associated nucleic acid sequence or amino acid sequence levels 
in a cell. 

10 26. A method of treating an animal for a disease that is a result of loss of functional 
flto«aZ-associated nucleic acid or amino acid sequence comprising delivering a 
therapeutically effective amount of an atowaZ-associated amino acid sequence or 
nucleic acid sequence to a cell of said animal. 

27. The method of claim 1 8, 1 9, 20, 2 1 , 22, 23, 24, 25 or 26, wherein said amino acid 
1 5 sequence or nucleic acid sequence is delivered by a delivery vehicle. 

28. The method of claim 1 8, 1 9, 20, 21 , 22, 23, 24, 25 or 26, wherein said amino acid 
sequence or nucleic acid sequence is delivered by a delivery vehicle, and wherein 
said delivery vehicle is selected from the group consisting of an adenoviral 
vector, a retroviral vector, an adeno-associated viral vector, a plasmid, a 

20 liposome, a nucleic acid, a peptide, a lipid, a carbohydrate and a combination 

thereof. 

29. The method of claim 18, 19,20,21,22,23,24,25 or 26, wherein said amino acid 
sequence or nucleic acid sequence is delivered by a delivery vehicle, wherein said 
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delivery vehicle is selected from the group consisting of a viral vector or a non- 
viral vector. 

30. The method of claim 1 8, 19, 20, 2 1 , 22, 23, 24, 25 or 26, wherein said amino acid 
sequence or nucleic acid sequence is delivered by a delivery vehicle, and wherein 

5 said delivery vehicle is a cell. 

31. The method of claim 18, 19, 20, 21, 22, 23, 24, 25 or 26, wherein said atonal- 
associated amino acid sequence or nucleic acid sequence is Mathl. 

32. The method of claim 18, 19, 20, 21, 22, 23, 24, 25 or 26, wherein said atonal- 
associated amino acid sequence or nucleic acid sequence is Hathl . 

10 33. The method of claim 18, 19, 20, 21, 22, 23, 24, 25 or 26, wherein said cell 
contains an alteration in an afowa/-associated nucleic acid sequence or amino acid 
sequence, 

34. The method of claim 1 8, 1 9, 20, 2 1 , 22, 23, 24, 25 or 26, wherein said amino acid 
sequence has at least about 80% identity to about 20 contiguous amino acid 

15 residues of SEQ ID NO:58 (Hathl). 

35. The method of claim 18, 19, 20, 21, 22, 23, 24, 25 or 26, wherein said nucleic 
acid sequence encodes a polypeptide which has at least about 80% identity to 
about 20 contiguous amino acid residues of SEQ ID NO:58 (Hathl). 

36. The method of claim 18, 19, 20, 21, 22, 23, 24, 25 or 26, wherein said cell is a 
20 human cell. 

37. The method of claim 20 or 21 , wherein said delivery comprises injecting into an 
inner ear a therapeutically effective amount of an <zto/z<2/-associated amino acid 
sequence or nucleic acid sequence. 
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38. The method of claim 23 wherein said joint disease is osteoarthritis. 

39. The method of claim 24 or 25, wherein said cell is a cancer cell. 

40. A composition comprising an atonaZ-associated amino acid sequence or nucleic 
acid sequence in combination with a delivery vehicle, wherein said delivery 

5 vehicle results in delivery of a therapeutically effective amount of atonal- 

associated nucleic acid sequence or amino acid sequence into a cell. 
4L The composition of claim 40, wherein said delivery vehicle comprises a vector 
that expresses an atowaZ-associated nucleic acid sequence or amino acid sequence 
in an animal cell. 

10 42. The composition of claim 41, wherein said vector is selected from the group 
consisting of an adenoviral vector, aretroviral vector, an adeno-associated vector, 
a plasmid, a liposome, a protein, a lipid, a carbohydrate and a combination 
thereof of said vehicles. 

43. The composition of claim 41, wherein said vector is selected from the group 
1 5 consisting of a viral vector or a non-viral vector. 

44. The composition of claim 40, wherein said delivery vehicle is the receptor- 
binding domain of a bacterial toxin. 

45. The composition of claim 40, wherein said atoMaZ-associated nucleic acid 
sequence is operatively linked to nucleic acid sequence encoding a receptor- 

20 binding domain of a bacterial toxin. 

46. The composition of claim 40 wherein said aforcaZ-associated nucleic acid 
sequence is operatively linked to nucleic acid sequence encoding a protein 
transduction domain. 
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DECLARATION — Utility or Design Patent Application 



I v I Customer Number \ 
Direct all correspondence to: | * | 0 r Bar Code Label 


26,271 \) 

^ y 


OR | | Correspondence address below 


Melissa L. Sistrunk 
Name Fulbright & Jaworski LLP 



1301 McKinney, Suite 5100 

Address 



City 



Houston 



State 



Texas 



Country 



USA 



Telephone 71 3.651 .3735 



ZIP 77010-3095 



Fax 



713.651.5246 



I hereby declare that all statements made herein of my own knowledge are true and that ail statements made on information and 
belief are believed to be true; and further that these statements were made with the knowledge that willful fa se statements and 
the like so made are punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such willful false statements 
may jeopardize the validity of the application or any patent issued thereon, ^ 



NAME OF SOLE OR FIRST INVENTOR: 



Given Name 

(first and middle [if any]) 



Inventor's 
Signature 



Residence! 




| | A petition has been filed for this unsigned inventor 

Zoghbi 



Family Name 
or Surname 



USA 



Date 



US 



Citizenship 



Mailing 
Address: 



6618 Sewanee 



citv 


Houston 


TX 
State 


77005 

ZIP 


USA 

Country 


NAME OF SECOND INVENTOR: 


| j a petition has been filed for this unsigned inventor 


Given Name 
(first and middle [if 




Family Name Bellen 
or Surname 


Inventor's 
Signature 1 


J^gJ/T/^ 


Date 2 I HkoOX 


Residence: City 


ij^n^-^ 




^Coim^y 


USA 

Citizenship 


Mailing 
Address: 


. . ■ i 

341 1 Mt. Vernon 


City 


Houston 


TX 

State 


77006 

ZIP 


USA 

Country 



xl Additional inventors are being named on the 1 supplemental Additional Inventor(s) sheet(s) PTO/SB/02A attached hereto 
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DECLARATION 



ADDITIONAL INVENTOR (S) 
Supplemental Sheet 

i Papal of 1 



Name of Additional Joint Inventor, H any: 



|~j A patrtiort has twtrt Wed tor this unstar*** inventor 



O vert N«rrt* 

Uffrirt and middle g»af!%3) 



2 




Nessan 



Bermingham 



jjeaT" 

Gauntry t»£. 




Citi zenship 



^n!^t(eT1^| 

Adctms: 



st»t* 



Marrte of Additf ona* Joint Inventor, & any: 



ZIP 



Country j^/^ U-^^? 



a peMpn has been We<J for fots ur&ig/ied Inventor 



Giver* Nim« 

jrr_« MML*L#Wa tit a null 


Bassem 




Family Name 
or Surname 


Hassan 


Inventor's 


Dita 


tlAfttdAnee: Cltv 




USA 

Country 


USA 

Citizenship 


HalKnfi 






Ctty 




Slat* 


ZIP 


USA 

Country 


Namft Of Additional Joint Invents 


►r, If any; 


| [ A petition has been fited for tfili unsigned irwemor 


Given Nume 

r« u * J «U>U. t(# anvil 


N 


issim 




j F*ntfyName 
] or Surname 


Ben-ari© 


inventor's 












Date 


Ntevaseret Zion 

R*»lct«r«:et City 


State 


Israel 

Country 


Israel 

Citizenship 



Mailing 
Address: 



46 Hachoshen 8ivU 
Mevaseret Zton, 90805 ISRAEL 



Mevaseret Zion 



City 



90805 



state 



Israel 



Country 
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DECLARATION 



ADDITIONAL tNVENTOR(S) 
Supplemental Sheet 

_ Pgqe 1 of 1 




am* of Additional Joint Inventor, If any: 



j~" ] A petition has be#n Wed for this unsigned (nventor 



liven Nam* 
■ftlnyt and frtiddlo fTf any)) 



Nessan 



Family Nam© 
or Surname 



Bermingham 



Inventor'* 
Signature 




Co untry 



USA 



Pat* 



USA 



ZIP 



Country 



USA 



j j A petition has been filed for tnls unsigned investor 



FamNy Name 
or S vrnarofc 



Hassan 



DM* 



USA 



C ountry 



Citizenship 



USA 



ZIP 



Country 



USA 



j j A petition has been fifed for mis unsigned inventor 



Family Nam* 
or Surname 



Bervarte 



Inventor's 
Slpnatun* 



jMgyageret Zton 

ftesltf*ne*T£Ety ■ — 



Moiling 

AdCfTAM; 



State 



Israel 



Country 



Date 



Israel 



46 Hachoshen Blv<± 
Mevaseret 2ion, 90805 ISRAEL 



City 



Mevaseret Zion 



Swte 



90805 



ZIP 



Country 



Israel 



XVH 9fi'TT 30M 20 20/90 
3003/9 /3 :p©ATeoeu 



SEQUENCE LISTING 

<110> Zoghbi, Huda 

Ben-Arie, Nissim 
Bellen, Hugo 
Birmingham, Nessan 
Hassan, Bassem 

<120> COMPOSITIONS AND METHODS FOR THE THERAPEUTIC USE OF AN ATONAL- 
ASSOCIATED SEQUENCE FOR DEAFNESS, OSTEOARTHRITIS, AND ABNORMAL CELL 
PROLIFERATION 

<130> HO-P01899US3/09906355/OTA 99-47 

<140> US 09/980,381 
<141> 2001-11-30 

<150> US 60/137,060 
<151> 1999-06-01 

<150> US 60/176,993 
<151> 2000-01-19 

<150> PCT/US00/15410 
<151> 2000-06-01 

<160> 69 

<170> Patentln version 3.0 

<210> 1 

<211> 1065 

<212> DNA 

<213> HUMAN 

<400> 1 



atgtcccgcc 


tgctgcatgc 


agaagagtgg 


gctgaagtga 


aggagttggg 


agaccaccat 


60 


cgccagcccc 


agccgcatca 


tctcccgcaa 


ccgccgccgc 


cgccgcagcc 


acctgcaact 


120 


ttgcaggcga 


gagagcatcc 


cgtctacccg 


cctgagctgt 


ccctcctgga 


cagcaccgac 


180 


ccacgcgcct 


ggctggctcc 


cactttgcag 


ggcatctgca 


cggcacgcgc 


cgcccagtat 


240 


ttgctacatt 


ccccggagct 


gggtgcctca 


gaggccgctg 


cgccccggga 


cgaggtggac 


300 


ggccgggggg 


agctggtaag 


gaggagcagc 


ggcggtgcca 


gcagcagcaa 


gagccccggg 


360 


ccggtgaaag 


tgcgggaaca 


gctgtgcaag 


ctgaaaggcg 


gggtggtggt 


agacgagctg 


420 


ggctgcagcc 


gccaacgggc 


cccttccagc 


aaacaggtga 


atggggtgca 


gaagcagaga 


480 


cggctagcag 


ccaacgccag 


ggagcggcgc 


aggatgcatg 


ggctgaacca 


cgccttcgac 


540 


cagctgcgca 


atgttatccc 


gtcgttcaac 


aacgacaaga 


agctgtccaa 


atatgagacc 


*600 


ctgcagatgg 


cccaaatcta 


catcaacgcc 


ttgtccgagc 


tgctacaaac 


gcccagcgga 


660 


ggggaacagc 


caccgccgcc 


tccagcctcc 


tgcaaaagcg 


accaccacca 


ccttcgcacc 


720 


gcggcctcct 


atgaaggggg 


cgcgggcaac 


gcgaccgcag 


ctggggctca 


gcaggcttcc 


780 



ggagggagcc agcggccgac cccgcccggg agttgccgga ctcgcttctc agccccagct 840 

tctgcgggag ggtactcggt gcagctggac gctctgcact tctcgacttt cgaggacagc 900 

gccctgacag cgatgatggc gcaaaagaat ttgtctcctt ctctccccgg gagcatcttg 960 

cagccagtgc aggaggaaaa cagcaaaact tcgcctcggt cccacagaag cgacggggaa 1020 

ttttcccccc attcccatta cagtgactcg gatgaggcaa gttag 1065 



<210> 2 

<211> 354 

<212> PRT 

<213> HUMAN 

<400> 2 

Met Ser Arg Leu Leu His Ala Glu Glu Trp Ala Glu Val Lys Glu Leu 
15 10 15 

Gly Asp His His Arg Gin Pro Gin Pro His His Leu Pro Gin Pro Pro 
20 25 30 

Pro Pro Pro Gin Pro Pro Ala Thr Leu Gin Ala Arg Glu His Pro Val 
35 40 45 

Tyr Pro Pro Glu Leu Ser Leu Leu Asp Ser Thr Asp Pro Arg Ala Trp 
50 55 60 

Leu Ala Pro Thr Leu Gin Gly lie Cys Thr Ala Arg Ala Ala Gin Tyr 
65 70 75 80 

Leu Leu His Ser Pro Glu Leu Gly Ala Ser Glu Ala Ala Ala Pro Arg 
85 90 95 

Asp Glu Val Asp Gly Arg Gly Glu Leu Val Arg Arg Ser Ser Gly Gly 
100 105 110 

Ala Ser Ser Ser Lys Ser Pro Gly Pro Val Lys Val Arg Glu Gin Leu 
115 120 125 

Cys Lys Leu Lys Gly Gly Val Val Val Asp Glu Leu Gly Cys Ser Arg 
130 135 140 

Gin Arg Ala Pro Ser Ser Lys Gin Val Asn Gly Val Gin Lys Gin Arg 
145 150 155 160 

Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn 
165 170 175 

His Ala Phe Asp Gin Leu Arg Asn Val lie Pro Ser Phe Asn Asn Asp 
180 185 190 

Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Gin lie Tyr lie 
195 200 205 

Asn Ala Leu Ser Glu Leu Leu Gin Thr Pro Ser Gly Gly Glu Gin Pro 
210 215 220 



Pro Pro Pro Pro Ala Ser Cys Lys Ser Asp His His His Leu Arg Thr 
225 230 235 240 

Ala Ala Ser Tyr Glu Gly Gly Ala Gly Asn Ala Thr Ala Ala Gly Ala 
245 250 255 

Gin Gin Ala Ser Gly Gly Ser Gin Arg Pro Thr Pro Pro Gly Ser Cys 
260 265 270 

Arg Thr Arg Phe Ser Ala Pro Ala Ser Ala Gly Gly Tyr Ser Val Gin 
275 280 285 

Leu Asp Ala Leu His Phe Ser Thr Phe Glu Asp Ser Ala Leu Thr Ala 
290 295 300 

Met Met Ala Gin Lys Asn Leu Ser Pro Ser Leu Pro Gly Ser lie Leu 
305 310 315 320 

Gin Pro Val Gin Glu Glu Asn Ser Lys Thr Ser Pro Arg Ser His Arg 
325 330 335 

Ser Asp Gly Glu Phe Ser Pro His Ser His Tyr Ser Asp Ser Asp Glu 
340 345 350 

Ala Ser 



<210> 3 

<211> 420 

<212> DNA 

<213> MOUSE 

<400> 3 

tcgtcccacg cgtccgcgcc aagcccgcgg cgcggaggac accgtgctcg gttccgggct 60 

gcggggacat tcccggacac acaccggagc agcagctgcg ccgcgacaca tctggagccg 120 

cgtaggatgt tcgtcaaatc tgagactctg gagttgaagg aggaagagga ggtactgatg 180 

ctgctgggct cggcttcccc ggcctcggcg accctgaccc cgatgtcctc cagcgcggac 240 

gaggaggagg acgaggagct gcgccggccg ggctccgcgc gtgggcagcg tggagcggaa 300 

gccgggcagg gggtgcaggg cagtccggcg tcgggtgcct ggggttgccg gacagggcgg 360 

ctgctatgca ctgtgcacga gtgctagcgt gtgccgtcgc gctcacgggc cgtctgcaga 420 

<210> 4 

<211> 645 

<212> DNA 

<213> MOUSE 

<400> 4 

atggcgcctc atcccttgga tgcgctcacc atccaagtgt ccccagagac acaacaacct 60 

tttcccggag cctcggacca cgaagtgctc agttccaatt ccaccccacc tagccccact 120 

ctcataccta gggactgctc cgaagcagaa gtgggtgact gccgagggac ctcgaggaag 180 

ctccgcgccc gacgcggagg gcgcaacagg cccaagagcg agttggcact cagcaaacag 24 0 



cgaagaagcc ggcgcaagaa ggccaatgat cgggagcgca atcgcatgca caacctcaac 300 

tcggcgctgg atgcgctgcg cggtgtcctg cccaccttcc cggatgacgc caaacttaca 360 

aagatcgaga ccctgcgctt cgcccacaac tacatctggg cactgactca gacgctgcgc 420 

atagcggacc acagcttcta tggcccggag ccccctgtgc cctgtggaga gctggggagc 4 80 

cccggaggtg gctccaacgg ggactggggc tctatctact ccccagtctc ccaagcgggt 54 0 

aacctgagcc ccacggcctc attggaggaa ttccctggcc tgcaggtgcc cagctcccca 600 

tcctatctgc tcccgggagc actggtgttc tcagacttct tgtga 645 



<210> 5 

<211> 214 

<212> PRT 

<213> MOUSE 

<400> 5 

Met Ala Pro His Pro Leu Asp Ala Leu Thr lie Gin Val Ser Pro Glu 
15 10 15 

Thr Gin Gin Pro Phe Pro Gly Ala Ser Asp His Glu Val Leu Ser Ser 
20 25 30 

Asn Ser Thr Pro Pro Ser Pro Thr Leu lie Pro Arg Asp Cys Ser Glu 
35 40 45 

Ala Glu Val Gly Asp Cys Arg Gly Thr Ser Arg Lys Leu Arg Ala Arg 
50 55 60 

Arg Gly Gly Arg Asn Arg Pro Lys Ser Glu Leu Ala Leu Ser Lys Gin 
65 70 75 80 

Arg Arg Ser Arg Arg Lys Lys Ala Asn Asp Arg Glu Arg Asn Arg Met 
85 90 95 

His Asn Leu Asn Ser Ala Leu Asp Ala Leu Arg Gly Val Leu Pro Thr 
100 105 110 

Phe Pro Asp Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala 
115 120 125 

His Asn Tyr lie Trp Ala Leu Thr Gin Thr Leu Arg lie Ala Asp His 
130 135 140 

Ser Phe Tyr Gly Pro Glu Pro Pro Val Pro Cys Gly Glu Leu Gly Ser 
145 150 155 160 

Pro Gly Gly Gly Ser Asn Gly Asp Trp Gly Ser lie Tyr Ser Pro Val 
165 170 175 

Ser Gin Ala Gly Asn Leu Ser Pro Thr Ala Ser Leu Glu Glu Phe Pro 
180 185 190 

Gly Leu Gin Val Pro Ser Ser Pro Ser Tyr Leu Leu Pro Gly Ala Leu 
195 200 205 



Val Phe Ser Asp Phe Leu 
210 

<210> 6 

<211> 1412 

<212> DNA 

<213> MOUSE 

<400> 6 



cgtgctcggt 


tccgggctgc 


ggggacattc 


ccggacacac 


accggagcag 


cagctgcgcc 


60 


gcgacacatc 


tggagccgcg 


taggatgttc 


gtcaaatctg 


agactctgga 


gttgaaggag 


120 


gaagaggagg 


tactgatgct 


gctgggctcg 


gcttccccgg 


cctcggcgac 


cctgaccccg 


180 


atgtcctcca 


gcgcggacga 


ggaggaggac 


gaggagctgc 


gccggccggg 


ctccgcgcgt 


240 


gggcagcgtg 


gagcggaagc 


cgggcagggg 


gtgcagggca 


gtccggcgtc 


gggtgccggg 


300 


ggttgccggc 


cagggcggct 


gctgggcctg 


atgcacgagt 


gcaagcgtcg 


cccgtcgcgc 


360 


tcacgggccg 


tctcccgagg 


tgccaagacg 


gcggagacgg 


tgcagcgcat 


caagaagacc 


420 


cgcaggctca 


aggccaacaa 


ccgcgagcgc 


aaccgcatgc 


acaacctaaa 


cgccgcgctg 


480 


gacgcgctgc 


gcgaggtgct 


gcccaccttc 


cccgaggatg 


ccaagctcac 


gaagatcgag 


540 


acgctgcgct 


tcgcccacaa 


ttacatctgg 


gcgctcaccg 


agactctgcg 


cctggcggac 


600 


cactgcgccg 


gcgccggtgg 


cctccagggg 


gcgctcttca 


cggaggcggt 


gctcctgagc 


660 


ccgggagctg 


cgctcggcgc 


cagcggggac 


agcccttctc 


caccttcctc 


ctggagctgc 


720 


accaacagcc 


cggcgtcatc 


ctccaactcc 


acgtccccat 


acagctgcac 


tttatcgccc 


780 


gctagccccg 


ggtcagacgt 


ggactactgg 


cagcccccac 


ctccggagaa 


gcatcgttat 


840 


gcgcctcacc 


tgcccctcgc 


cagggactgt 


atctagagct 


gcgggtctcc 


ctctctcgtc 


900 


ct ctacccgg 


ccct ctt ccc 


at cctt ct cc 


cgcccct cac 


cct ccacgcc 


ccggact cca 


you 


cttcacagag 


cagaggtggc 


ccttgcaatc 


ccctcggcgg 


ctggtgcatt 


cgggggtgga 


1020 


gaccagctct 


ggtttattga 


agatgtgagg 


atttatggtc 


aaagaggact 


atggcgtgtg 


1080 


ggagtggggg 


ctggcgtggg 


gaacctcgta 


agactgtaaa 


agacactgag 


aaaaagtacc 


1140 


ataactaacg 


agtgtgcaga 


gcagactgac 


gctcctcccc 


tctctcagag 


ctgctggagg 


1200 


agaactccgg 


gcaggcagtt 


cgtgtgaatc 


tctcagaggg 


aatgcaactg 


gtccctgtga 


1260 


tcttttcacc 


ttcgtttcta 


catagagatg 


ttaatgtcag 


tcgaaagaaa 


tgtattttag 


1320 


catctgaatg 


aatttactgg 


taataatatt 


atccacacat 


ttgcaatggc 


tggcatctgc 


1380 


tctattccca 


ttgctgtctg 


caggctgtgg 


ga 






1412 



<210> 7 
<211> 263 



<212> PRT 
<213> MOUSE 



<400> 7 

Met Phe Val Lys 

1 

Leu Met Leu Leu 
20 

Met Ser Ser Ser 
35 

Gly Ser Ala Arg 
50 

Gly Ser Pro Ala 
65 

Gly Leu Met His 



Ser Arg Gly Ala 
100 

Arg Arg Leu Lys 
115 

Asn Ala Ala Leu 
130 

Asp Ala Lys Leu 
145 

lie Trp Ala Leu 



Ala Gly Gly Leu 
180 

Pro Gly Ala Ala 
195 

Ser Trp Ser Cys 
210 

Pro Tyr Ser Cys 
225 

Tyr Trp Gin Pro 



Pro Leu Ala Arg 
260 

<210> 8 
<211> 1957 
<212> DNA 
<213> MOUSE 



Ser Glu 
5 

Gly Ser 

Ala Asp 

Gly Gin 

Ser Gly 
70 

Glu Cys 
85 

Lys Thr 



Ala Asn 



Asp Ala 



Thr Lys 
150 

Thr Glu 
165 

Gin Gly 



Leu Gly 



Thr Asn 



Thr Leu 
230 

Pro Pro 
245 

Asp Cys 



Thr Leu 

Ala Ser 

Glu Glu 
40 

Arg Gly 
55 

Ala Gly 

Lys Arg 

Ala Glu 

Asn Arg 
120 

Leu Arg 
135 

lie Glu 

Thr Leu 

Ala Leu 

Ala Ser 
200 

Ser Pro 
215 

Ser Pro 
Pro Glu 
lie 



Glu Leu 
10 

Pro Ala 
25 

Glu Asp 

Ala Glu 

Gly Cys 

Arg Pro 
90 

Thr Val 
105 

Glu Arg 
Glu Val 



Thr Leu 

Arg Leu 

170 

Phe Thr 
185 

Gly Asp 

Ala Ser 

Ala Ser 

Lys His 
250 



Lys Glu 

Ser Ala 

Glu Glu 

Ala Gly 
60 

Arg Pro 
75 

Ser Arg 

Gin Arg 

Asn Arg 

Leu Pro 
140 

Arg Phe 
155 

Ala Asp 

Glu Ala 

Ser Pro 

Ser Ser 
220 

Pro Gly 
235 

Arg Tyr 



Glu Glu 

Thr Leu 
30 

Leu Arg 
45 

Gin Gly 

Gly Ar.g 

Ser Arg 

lie Lys 
110 

Met His 
125 

Thr Phe 

Ala His 

His Cys 

Val Leu 
190 

Ser Pro 
205 

Asn Ser 
Ser Asp 
Ala Pro 



Glu Val 
15 

Thr Pro 
Arg Pro 
Val Gin 



Leu Leu 
80 

Ala Val 
95 

Lys Thr 



Asn Leu 



Pro Glu 



Asn Tyr 
160 

Ala Gly 
175 

Leu Ser 

Pro Ser 

Thr Ser 

Val Asp 
240 

His Leu 
255 



<400> 8 



gatacacaca 


gatctagagg 


ctccaggaga 


cgatgcgaca 


ctcagcctga 


aaagatttgg 


60 


aagatccaaa 


atgaaaactg 


attattgaat 


gaaattaaaa 


cctaaggtaa 


attaaggtta 


120 


aagaaccatg 


ttaacactac 


cgtttgacga 


gtctgtcgta 


atgcccgaat 


cccagatgtg 


180 


cagaaagttt 


gctagacaat 


gtgaggacca 


gaaacaaatt 


aagaaaccag 


agagctttcc 


240 


aaaacaagtt 


gtccttcgag 


gaaagagcat 


taaaagggcc 


cctggagaag 


aaaccgagaa 


300 


agaagaggag 


gaagaagaca 


gagaggaaga 


agatgagaat 


ggcttgtcca 


gaaggagggg 


360 


gctcaggaaa 


aaaaagacca 


ccaaactacg 


actggaaagg 


gtcaagttca 


ggagacagga 


420 


agctaatgcg 


cgcgagagga 


accggatgca 


cggcctcaat 


gatgctctgg 


acaatttgcg 


480 


aaaagtggtc 


ccctgttact 


ctaaaaccca 


aaaactgtcc 


aaaatagaaa 


ctttacgact 


540 


ggccaaaaat 


tacatctggg 


cactttctga 


aattctgagg 


attggcaaga 


gaccggatct 


600 


gctcacgttc 


gtccaaaact 


tatgcaaagg 


tctttcccag 


ccaactacaa 


acttggtggc 


660 


aggctgctta 


cagctcaacg 


ccagaagttt 


cctgatgggt 


cagggtgggg 


aggctgccca 


720 


ccacacaagg 


tcaccctact 


ccacattcta 


cccaccctac 


cacagccctg 


agctggccac 


780 


tcccccaggg 


catgggactc 


ttgataattc 


caagtccatg 


aaaccctaca 


attactgcag 


840 


tgcatatgaa 


tccttctatg 


aaagtacctc 


ccctgagtgt 


gccagccctc 


agtttgaagg 


900 


tcccttaagt 


cct cccccaa 


ttaactataa 


tgggatattt 


tccctgaagc 


aagaagaaac 


960 


cttggactat 


ggcaaaaatt 


acaattatgg 


catgcattac 


tgtgcagtgc 


cacccagggg 


1020 


tccccttggg 


cagggtgcca 


tgttcaggtt 


gcccaccgac 


agccacttcc 


cttacgactt 


1080 


acatctgcgc 


agccaatctc 


tcactatgca 


agatgaatta 


aatgcagttt 


ttcataatta 


1140 


atgaggaaaa 


tgaaaataaa 


cagtggt cat 


tcacctccca 


ctctaattaa 


ggcaaagcag 


1200 


atgcttgtgg 


gctgcgtaat 


tggcacaact 


ctatctaagg 


tgtttactag 


tttctgaagt 


1260 


gtgtctcaaa 


gattgtgacc 


attttctatg 


tcataataaa 


tcccttttcg 


tatgagaact 


1320 


tcctttcctt 


ccctcttgtc 


tgtatcacac 


tgtgattctc 


tctctctctc 


tctctctctc 


1380 


tctctctctc 


tctctctctt 


actggcagaa 


tatttctttc 


ttgttttagt 


ttctttcaaa 


1440 


ttcacttaat 


ttgtttgaac 


aaggtgtcta 


agatgttgct 


gaataaagac 


atgcacacag 


1500 


catacttcaa 


tgtctatttc 


agttgtacag 


ttatgatgaa 


aatgcatgtt 


ataaaaatca 


1560 


gatgagtaaa 


atgtgtttat 


aattactagg 


attcatatat 


gtatctctga 


aattttagtt 


1620 


tttaaaatat 


taagagctaa 


ccatgaaatt 


aaaaggtgca 


tttggggatg 


cacaacggta 


1680 


tcaaaagcta 


tgcaattttc 


tgtttattag 


ggacaaaaat 


aagtgtattc 


agttggtaac 


1740 


aactattcct 


cttcaagcat 


tttcagagga 


ggaaacacgg 


tatttggggg 


aggttatcag 


1800 



tgtcataatt tggggacaat taattcaatc atgaagaaaa aaaatattag cacttgtttt 
gtattgttca ggatttttct gtacaggttt gttacagtgt ataattgtgt tttccatcct 
acagtttaaa agcaattaaa catagatatt tccactt 



1860 
1920 
1957 



<210> 9 

<211> 337 

<212> PRT 

<213> MOUSE 

<400> 9 

Met Leu Thr Leu Pro Phe Asp Glu Ser Val Val Met Pro Glu Ser Gin 
15 10 15 

Met Cys Arg Lys Phe Ala Arg Gin Cys Glu Asp Gin Lys Gin lie Lys 
20 25 30 

Lys Pro Glu Ser Phe Pro Lys Gin Val Val Leu Arg Gly Lys Ser lie 
35 40 45 

Lys Arg Ala Pro Gly Glu Glu Thr Glu Lys Glu Glu Glu Glu Glu Asp 
50 55 60 

Arg Glu Glu Glu Asp Glu Asn Gly Leu Ser Arg Arg Arg Gly Leu Arg 
65 70 75 80 

Lys Lys Lys Thr Thr Lys Leu Arg Leu Glu Arg Val Lys Phe Arg Arg 
85 90 95 

Gin Glu Ala Asn Ala Arg Glu Arg Asn Arg Met His Gly Leu Asn Asp 
100 105 110 

Ala Leu Asp Asn Leu Arg Lys Val Val Pro Cys Tyr Ser Lys Thr Gin 
115 120 125 

Lys Leu Ser Lys lie Glu Thr Leu Arg Leu Ala Lys Asn Tyr lie Trp 
130 135 140 

Ala Leu Ser Glu lie Leu Arg lie Gly Lys Arg Pro Asp Leu Leu Thr 
145 150 155 160 

Phe Val Gin Asn Leu Cys Lys Gly Leu Ser Gin Pro Thr Thr Asn Leu 
165 170 175 

Val Ala Gly Cys Leu Gin Leu Asn Ala Arg Ser Phe Leu Met Gly Gin 
180 185 190 

Gly Gly Glu Ala Ala His His Thr Arg Ser Pro Tyr Ser Thr Phe Tyr 
195 200 205 

Pro Pro Tyr His Ser Pro Glu Leu Ala Thr Pro Pro Gly His Gly Thr 
210 215 220 

Leu Asp Asn Ser Lys Ser Met Lys Pro Tyr Asn Tyr Cys Ser Ala Tyr 
225 230 235 240 



Glu Ser Phe Tyr Glu Ser Thr Ser Pro Glu Cys Ala Ser Pro Gin Phe 
245 250 255 



Glu Gly Pro Leu Ser Pro Pro Pro lie Asn Tyr Asn Gly lie Phe Ser 
260 265 270 



Leu Lys Gin Glu Glu Thr Leu Asp Tyr Gly Lys Asn Tyr Asn Tyr Gly 
275 280 285 

Met His Tyr Cys Ala Val Pro Pro Arg Gly Pro Leu Gly Gin Gly Ala 
290 295 300 

Met Phe Arg Leu Pro Thr Asp Ser His Phe Pro Tyr Asp Leu His Leu 
305 310 315 320 

Arg Ser Gin Ser Leu Thr Met Gin Asp Glu Leu Asn Ala Val Phe His 
325 330 335 

Asn 

<210> 10 
<211> 1056 
<212> DNA 
<213> MOUSE 

<400> 10 

atgtcccgcc tgctgcatgc agaagagtgg gctgaggtaa aagagttggg ggaccaccat 60 

cgccatcccc agccgcacca cgtcccgccg ctgacgccac agccacctgc taccctgcag 120 

gcgagagacc ttcccgtcta cccggcagaa ctgtccctcc tggatagcac cgacccacgc 180 

gcctggctga ctcccacttt gcagggcctc tgcacggcac gcgccgccca gtatctgctg 240 

cattctcccg agctgggtgc ctccgaggcc gcggcgcccc gggacgaggc tgacagccag 300 

ggtgagctgg taaggagaag cggctgtggc ggcctcagca agagccccgg gcccgtcaaa 360 

gtacgggaac agctgtgcaa gctgaagggt ggggttgtag tggacgagct tggctgcagc 420 

cgccagcgag ccccttccag caaacaggtg aatggggtac agaagcaaag gaggctggca 480 

gcaaacgcaa gggaacggcg caggatgcac gggctgaacc acgccttcga ccagctgcgc 540 

aacgttatcc cgtccttcaa caacgacaag aagctgtcca aatatgagac cctacagatg 600 

gcccagatct acatcaacgc tctgtcggag ttgctgcaga ctcccaatgt cggagagcaa 660 

ccgccgccgc ccacagcttc ctgcaaaaat gaccaccatc accttcgcac cgcctcctcc 720 

tatgaaggag gtgcgggcgc ctctgcggta gctggggctc agccagcccc gggagggggc 780 

ccgagaccta ccccgcccgg gccttgccgg actcgcttct caggcccagc ttcctctggg 84 0 

ggttactcgg tgcagctgga cgctttgcac ttcccagcct tcgaggacag ggccctaaca 900 

gcgatgatgg cacagaagga cctgtcgcct tcgctgcccg ggggcatcct gcagcctgta 960 

caggaggaca acagcaaaac atctcccaga tcccacagaa gtgacggaga gttttccccc 1020 

cactctcatt acagtgactc tgatgaggcc agttag 1056 



<210> 11 

<211> 351 

<212> PRT 

<213> MOUSE 

<400> 11 

Met Ser Arg Leu 
1 

Gly Asp His His 
20 

Pro Gin Pro Pro 
35 

Ala Glu Leu Ser 
50 

Pro Thr Leu Gin 
65 

His Ser Pro Glu 



Ala Asp Ser Gin 
100 

Ser Lys Ser Pro 
115 

Lys Gly Gly Val 
130 

Pro Ser Ser Lys 
145 

Ala Asn Ala Arg 



Asp Gin Leu Arg 
180 

Ser Lys Tyr Glu 
195 

Ser Glu Leu Leu 
210 

Thr Ala Ser Cys 
225 

Tyr Glu Gly Gly 



Pro Gly Gly Gly 

260 

Phe Ser Gly Pro 
275 



Leu His 
5 

Arg His 

Ala Thr 

Leu Leu 

Gly Leu 
70 

Leu Gly 
85 

Gly Glu 

Gly Pro 

Val Val 

Gin Val 
150 

Glu Arg 
165 

Asn Val 

Thr Leu 

Gin Thr 

Lys Asn 
230 

Ala Gly 
245 

Pro Arg 
Ala Ser 



Ala Glu 

Pro Gin 

Leu Gin 
40 

Asp Ser 
55 

Cys Thr 

Ala Ser 

Leu Val 

Val Lys 
120 

Asp Glu 
135 

Asn Gly 

Arg Arg 

lie Pro 

Gin Met 
200 

Pro Asn 
215 

Asp His 

Ala Ser 

Pro Thr 

Ser Gly 
280 



Glu Trp 
10 

Pro His 
25 

Ala Arg 



Thr Asp 



Ala Arg 



Glu Ala 
90 

Arg Arg 
105 

Val Arg 



Leu Gly 



Val Gin 

Met His 
170 

Ser Phe 
185 

Ala Gin 

Val Gly 

His His 

Ala Val 
250 

Pro Pro 
265 

Gly Tyr 



Ala Glu 

His Val 

Asp Leu 

Pro Arg 
60 

Ala Ala 
75 

Ala Ala 

Ser Gly 

Glu Gin 

Cys Ser 
140 

Lys Gin 
155 

Gly Leu 

Asn Asn 

lie Tyr 

Glu Gin 
220 

Leu Arg 
235 

Ala Gly 
Gly Pro 
Ser Val 



Val Lys 

Pro Pro 
30 

Pro Val 
45 

Ala Trp 

Gin Tyr 

Pro Arg 

Cys Gly 
110 

Leu Cys 
125 

Arg Gin 

Arg Arg 

Asn His 

Asp Lys 
190 

He Asn 
205 

Pro Pro 

Thr Ala 

Ala Gin 

Cys Arg 
270 

Gin Leu 
285 



Glu Leu 
15 

Leu Thr 
Tyr Pro 
Leu Thr 



Leu Leu 
80 

Asp Glu 
95 

Gly Leu 

Lys Leu 

Arg Ala 

Leu Ala 
160 

Ala Phe 
175 

Lys Leu 
Ala Leu 



Pro Pro 

Ser Ser 
240 

Pro Ala 
255 

Thr Arg 
Asp Ala 



Leu His Phe Pro Ala Phe Glu Asp Arg Ala Leu Thr Ala Met Met Ala 
290 295 300 

Gin Lys Asp Leu Ser Pro Ser Leu Pro Gly Gly lie Leu Gin Pro Val 
305 310 315 320 

Gin Glu Asp Asn Ser Lys Thr Ser Pro Arg Ser His Arg Ser Asp Gly 
325 330 335 

Glu Phe Ser Pro His Ser His Tyr Ser Asp Ser Asp Glu Ala Ser 
340 345 350 

<210> 12 
<211> 3261 
<212> DNA 
<213> MOUSE 



<400> 12 



aatctataca 


tggcaaaaat 


gtatatgaaa 


tccaaggaca 


tggtggagct 


ggtcaacaca 


60 


caatcctgga 


tggacaaagg 


tctgagctct 


caaaatgaga 


tgaaggagca 


agagagaaga 


120 


ccgggctctt 


atggaatgct 


cggaacctta 


actgaagagc 


atgacagtat 


tgaggaggat 


180 


gaagaagagg 


aagaagatgg 


agataaacct 


aaaagaagag 


gtcccaagaa 


aaagaagatg 


240 


actaaagctc 


gccttgaaag 


attcagggct 


cgaagagtca 


aggccaatgc 


tagagaacgg 


300 


acccggatgc 


atggcctgaa 


tgatgccttg 


gataatctta 


ggagagtcat 


gccatgttac 


360 


tctaaaactc 


aaaagctttc 


caagatagag 


actcttcgac 


tggcaaggaa 


ctacatctgg 


420 


gccttgtctg 


aagtcctgga 


gactggtcag 


acacttgaag 


ggaagggatt 


tgtagagatg 


480 


ctatgtaaag 


gtctctctca 


acccacaagc 


aacctggttg 


ctggatgcct 


ccaactgggg 


540 


cctcaatcta 


ccctcctgga 


gaagcatgag 


gaaaaatctt 


caatttgtga 


ctctactatc 


600 


tctgtccaca 


gcttcaacta 


tcagtctcca 


gggctcccca 


gccctcctta 


tggccatatg 


660 


gaaacacatt 


ctctccatct 


caagcctcaa 


ccatttaaga 


gtttgggtga 


ctcttttggg 


720 


agccatccac 


ctgactgcag 


taccccccct 


tatgagggtc 


cactcacacc 


acccctgagc 


780 


attagtggca 


acttctcctt 


aaagcaagac 


ggctcccctg 


atttggaaaa 


atcctacaat 


840 


ttcatgccac 


attatacctc 


tgcaagtcta 


agttcagggc 


atgtgcattc 


aactcccttt 


900 


cagactggca 


ctccccgcta 


tgatgttcct 


gtagacctga 


gctatgattc 


ctactcccac 


960 


catagcattg 


gaactcagct 


caatacgatc 


ttctctgatt 


agagcaataa 


gataagcacc 


1020 


aatatttcag 


agaatgaagt 


ggagattttt 


ttcacatttc 


tagtggctga 


gctaaactct 


1080 


cagaaaattt 


aaaagaacct 


ttggatatgc 


atcaaacata 


atagtcctag 


tttgttcaga 


1140 


acttcctgta 


cctgctaact 


ttcttcccat 


taacttctca 


cattggacca 


gtcctacatt 


1200 


tggtaaactt 


aagtgaatat 


atttgatggt 


ttgaggccac 


atggtaatag 


aacagaaaga 


1260 



aagcccaggc 


cctgttccaa 


tggtgccaaa 


gattaattga 


atgctctgcc 


aattaacttt 


1320 


ccatttccag 


tgtttttatt 


gctttctgat 


aaacatgaag 


caactgttcc 


aaatcaacat 


1380 


atacctttca 


cctcccacac 


atttttaaat 


ttaaaagcaa 


tcaaagcaaa 


tagcaaaaat 


1440 


ggaatgatta 


tacagaactg 


gaaggagcat 


caagtacatg 


tctgttggct 


tatagaatac 


1500 


aaaatttgtg 


tgaatttgac 


aaatcatctt 


tgtgtctatt 


taaatataac 


ttccagagcc 


1560 


taaaataata 


acttggatgt 


taccataaat 


aaccagtatg 


ttctttaaga 


gatcagctct 


1620 


acttattact 


gtgctgaaag 


gtatacacac 


cttttttagt 


gattggagaa 


ccatgataga 


1680 


agcctcacac 


aaactttatt 


ctttatacta 


tttaaaaaac 


aactgtctta 


gtactaggag 


1740 


acaagcaaga 


aagacattga 


aatttctctt 


ctggcacaca 


gaaatatcac 


ctagctcatt 


1800 


tcttagctcc 


cgtgactata 


gggctgtaga 


attttgcagg 


tattcaggtg 


cttcagttag 


1860 


aatcagaact 


cagccaggtt 


catactgtag 


agcaataaaa 


tggtggttgg 


ctgctatccc 


1920 


aactaacaca 


gttaaagaac 


tctgcctcac 


acatagccac 


tggaaaaatg 


tggatattct 


1980 


ccagcaagat 


gaatttcact 


gtttaaagca 


atgcaattaa 


agccatagag 


tttcgtccac 


2040 


tccagtatca 


tatattccag 


aactgttgta 


atcaatcttg 


aattcttaca 


acataaatgc 


2100 


aactccttac 


ttcccaccta 


acactgattg 


ttatattgtc 


ttcaattcca 


agatattatg 


2160 


caattatatg 


caaaattttg 


attagaatca 


aaattaagag 


tcaatgaatc 


tgtctgtatc 


2220 


ttcaggacgg 


gttttgatca 


gttttaagaa 


agtttatttt 


cctttatgtg 


gcatctcttt 


2280 


tctttgtaac 


cacactggtt 


cagccaagtt 


tctcttctcc 


agagaaatta 


gctctgagaa 


2340 


attttactat 


catgatccat 


cttccacagc 


aattatttag 


gttcaactca 


agagtataca 


2400 


tagtttattt 


atagtgggtg 


aggatacacc 


tccaagaata 


aattttaaca 


acattaatga 


2460 


catatgaata 


tgccatttta 


tctaccaaac 


tatatatgta 


tctcttttct 


ttattgccct 


2520 


tatttatttc 


ttcacactga 


gagttatttt 


gtgtccatct 


tattgcagca 


cttactctgc 


2580 


tctactttgc 


acctttggat 


tataaatatg 


tttaaaagtc 


tgtaaagacg 


tcttaaacaa 


2640 


ctcgt gacag 


taattcacca 


cccctaagac 


cttgaatcac 


cctagtggaa 


ataggcaagg 


2700 


agaattattt 


atagaatcat 


cctatgtaat 


tttttttgag 


aatttgctct 


acctagcatt 


2760 


tatgtttata 


gataattgct 


atctgcatta 


tttattaggt 


tctatttatt 


taatttatct 


2820 


ttctttcttt 


ttatgtaaac 


atttgtgccc 


catagatata 


gcctcaaagc 


ttcactggga 


2880 


aactagctta 


tatgtttgga 


gtgagagaaa 


aggagaaaat 


cagttcttga 


ttgcttgcaa 


2940 


tggttttata 


aaacagagca 


ataatttgaa 


tagatatgca 


acttaatggt 


tttagaattt 


3000 


ttcctttaag 


gtgcaacaga 


gttacattat 


tatttatgac 


tttggagaat 


gtagtacatg 


3060 


tgaaccagga 


ctgtaggctt 


gtgaagagag 


attttataat 


taaatacaaa 


tttagtactg 


3120 



tactatgctt ggaaagaact tgttctttaa ataatgttta gtcttctggg agtgttttca 
gataaaatga agcaattgtt taaaagaaat ggctgttttc cctccctttt ccagtagcaa 
taaagctttg agtgttatta c 



3180 
3240 
3261 



<210> 13 

<211> 501 

<212> DNA 

<213> ZEBRA FISH 

<220> 

<221> misc_f eature 

<222> (115) . . (115) 

<223> "n" can be any nucleotide 



<400> 13 

atgttaacgg taccgtttga agagccagat atgatgcgcg agtctcagtt tggcgccacg 60 

ttcacgcgtc aggaagacgt ccggacactc agcagcgccg agctcaagga ggcanaggac 120 

gacaacacgg acagggagga ggaggaggag agagaggagg actataacgg gctgccaaag 180 

aagaagggtc cccgcaaaaa gaaatccgag ggacgcggtg accgagtcaa aatgcgccgt 240 

caggaagcaa acgcgcgtga gcgcagccgc atgcacggtc taaacgacgc gctcgaaagc 300 

ctgcgcaaag tcgtgccgtg ctactccaaa acgcagaaac tctccaagat cgaaaccctg 360 

gggctgggca agaattacat ttgggctctg tctgagactt tgagcgcagg aaagcgacct 420 

ggcctgcttg ggttggtagc aaccctgggc gtgggctggt ctagaggaca gaccagcttg 480 

gtgggggagt gcctgcagct a 501 

<210> 14 
<211> 609 
<212> DNA 
<213> HUMAN 

<400> 14 

tttgacattg aattatgctg tgtgcatgtc tttattcaac agtatattct tagacacctt 60 

gttcaaacaa attaaataag tttaaaagaa aattaaaaag aaaaaaatct ttccagtaga 120 

aacagaatca cagtgcttca cagacaaaag gaaaggaaaa gaagttctca tacgaaaaga 180 

gatttattat tacatagaaa attctcacaa tagttgaaac acacttcaga aactagtaaa 240 

caccttagat agagttgtgc caattactca gcccacaagc atctgctttg tcttaattag 300 

acaggggagg tgaatgacca ctgtttattt tcattttcct cattaattat gaaaaactgc 360 

atttaattca tcttgcattg tgagagattg gctgcgcaga tgtaagtcgt aagggaagtg 420 

gctgtcggtg ggcaacctga acatggcacc ctgcccaagg ggacccctgg gtggcactgc 4 80 



acagtaatgc 


atgccgtaat 


tgtaattttt 


accatagtcc 


aaggtttctt 


cttgcttcag 


540 


ggaaaatatc 


ccattatagg 


taattggggg 


aggacttaag 


ggaccttcaa 


actgagggct 


600 


ggcacactc 












609 


<210> 
<211> 
<212> 
<213> 


15 
675 
DNA 
HUMAN 












<400> 15 
tagagagcgg 


caggagacga 


tgcgacactc 


agcctgaaaa 


gatttggaag 


acccaaaatg 


60 


aaaactgatt 


attgaatgaa 


attaaaacct 


aaggtaattt 


aagattagag 


aaccatgtta 


120 


acactaccgt 


ttgatgagtc 


tgttgtaatg 


ccagaatccc 


agatgtgcag 


aaagttttct 


180 


agagaatgcg 


aggaccagaa 


gcaaattaag 


aagccagaaa 


gcttttccaa 


acagattgtc 


240 


cttcgaggaa 


agagcatcaa 


aagggcccct 


ggagaagaaa 


ccgagaaaga 


agaagaggag 


300 


gaagacaggg 


aagaggaaga 


tgaaaatggg 


ttgcctagaa 


ggaggggtct 


taggaaaaaa 


360 


aagacaacaa 


agctgcgatt 


ggaaagggtc 


aagttcagga 


gacaggaagc 


gaacgcgcgc 


420 


gagaggaaca 


ggatgcacgg 


cctcaacgac 


gctctggaca 


acttaagaaa 


agtggtccgc 


480 


tgttattcta 


tgacccagag 


actgtccaaa 


atagagactg 


tactactggc 


caaaaactac 


540 


atctgggcac 


tgtctgatat 


gctgagaatc 


ggcaagagac 


cagatctgct 


cacgattcgg 


600 


caaagctgat 


gcatagggtc 


ttgccagcca 


actacagact 


tggtggcagg 


ctggtcgcag 


660 


ctcaacgaca 


ggagt 










675 


<210> 
<211> 
<212> 
<213> 


16 

1476 
DNA 

DROSPHILA 












<220> 
<221> 
<222> 
<223> 


misc feature 
(203) . . (203) 
fr n" can be any 


nucleotide 










<220> 
<221> 
<222> 
<223> 


misc feature 
(217) . . (217) 
"n" can be any 


nucleotide 










<400> 16 
ctccaccctc 


gtgtgccgtt 


gagcgtgaaa 


gtcacaaatt 


atgggcgaca 


gagggagagg 


60 


gagagagaga 


gaggggaaga 


tcccatggca 


tgatagtacc 


aatttggcaa 


gatatgacgc 


120 


gtgcctggcg 


cgagagagat 


gacaacaggc 


gaagtgtagg 


cgtttcacca 


ccgagcgaaa 


180 



gggagggaaa catatctaac ccntaagtca ccccagntcc tcggggagat cctttggccg 240 

ggccatggta atagctgcga ggatcctggg gccaggggtc atttgcgggc catttacaca 300 

aacttggtca gaacggcggt gaaaatattt gtagaatgca tctcgcgggg gttgaccgta 360 

gtcaagtctg gatccgatcg gatcgtttca gttgcaacga aactttcaag ccgcgcggat 420 

ctgcaatgag ctactactac tcgtctgcct ccgaggagga tggcagttcc cagtatctgg 480 

gcagtcccaa ctacaacttg acccagttgc cgccagtttc tggccaggat tacggacagg 540 

gggctttctt atcgccggaa tggcaattct tggatgccgc aggcggaact caaacggaac 600 

taggacccat aatggaggtg caaggacagc acacccagcc gcagaccaaa cggcggagta 660 

acagctccac aggatcggac ggtaggaaga gcagtccaga gcagaccaat ctcagtccca 720 

cggtccagaa gagaagacga caggctgcca atgcgcggga aaggaagcgg atgaatggat 78 0 

taaatgcggc tttcgagcgc ctaagggaag tggtgcccgc tccgtccatt gaccagaaat 840 

tgtccaagtt cgagactctc cagatggccc agtcctacat cctggcgctg tgcgatctcc 900 

tcaacaacgg ggacgtggaa gtggatgccg ctgcatacac catcttcggc gacagcgata 960 

gtggatttgg attgagcgga ggatccttgt catagatgga tgatactaga ctaaagttat 1020 

gtgattttct tcttactgta gattaagtta aatatgtaat gaaataaatt gaaatgttta 1080 

attgaatgct aaaaataatt tcaatttcaa aggcattatt catggaacgc atcgcttgat 1140 

tgtgaaccaa gtgttcaatc caaaatgagc tactcatcta accctttcta ccaattataa 1200 

ccagcccact actatacaca gatccccaga ttcaattggc catcagatcg tttggctgcg 1260 

cctcaatgat ttcgaaaacg atcataaatt aattactcaa cagttcattg ataggcgtac 1320 

ttcaagtttt grtctaaccc cgaacgattc cgaaactgcg aagccaccac agtgggcggt 1380 

gtagcgtgac acccctgaac tcctggccag aactcctctc attgaataaa aaaggcagct 1440 

gagtccttcg gcttcggttg ggtgctgttt tttttt 1476 

<210> 17 
<211> 189 
<212> PRT 
<213> DROSOPHILA 

<400> 17 

Met Ser Tyr Tyr Tyr Ser Ser Ala Ser Glu Glu Asp Gly Ser Ser Gin 
15 10 15 

Tyr Leu Gly Ser Pro Asn Tyr Asn Leu Thr Gin Leu Pro Pro Val Ser 
20 25 30 

Gly Gin Asp Tyr Gly Gin Gly Ala Phe Leu Ser Pro Glu Trp Gin Phe 



Leu Asp Ala Ala Gly Gly Thr Gin Thr Glu Leu Gly Pro lie Met Glu 
50 55 60 

Val Gin Gly Gin His Thr Gin Pro Gin Thr Lys Arg Arg Ser Asn Ser 
65 70 75 80 

Ser Thr Gly Ser Asp Gly Arg Lys Ser Ser Pro Glu Gin Thr Asn Leu 
85 90 95 

Ser Pro Thr Val Gin Lys Arg Arg Arg Gin Ala Ala Asn Ala Arg Glu 
100 105 110 

Arg Lys Arg Met Asn Gly Leu Asn Ala Ala Phe Glu Arg Leu Arg Glu 
115 120 125 

Val Val Pro Ala Pro Ser lie Asp Gin Lys Leu Ser Lys Phe Glu Thr 
130 135 140 

Leu Gin Met Ala Gin Ser Tyr lie Leu Ala Leu Cys Asp Leu Leu Asn 
145 150 155 160 

Asn Gly Asp Val Glu Val Asp Ala Ala Ala Tyr Thr He Phe Gly Asp 
165 170 175 

Ser Asp Ser Gly Phe Gly Leu Ser Gly Gly Ser Leu Ser 
180 185 

<210> 18 
<211> 1074 
<212> DNA 
<213> CHICKEN 

<400> 18 

tttttccttt cccttttcct cgcggagccg cggccgcttt gccttcccct acagcgagga 60 

gctccgcttt ctcctctatt tatttctaaa ccgttattgc ttaaccgcag ccccgggagt 120 

gcctttccat gtgagtacgg gggacacagg cacctgcgat gcggtgcggc cgtgccaagc 180 

actcccaccc tccctccgtc cctccttggg gttactttgg gatatttttt ccctcccttt 240 

ccccctcccc tcagcacctt ccccccgctc cccctgccgc ccactgacgg ctccggcttc 300 

tctcccgcag gatgcccgcg gaggcggcga gcagcggcgg cgtttcggag ccgcccggag 360 

ctccgcggga gcggcggagg agacgcggcc gtgcgcgggc gcggaccgag gctttgctgc 420 

acaccctcaa acggagccgc cgggtgaaag ccaacgaccg ggagcggaac cgcatgcacc 480 

acctcaacgc cgcgctggat gagctccgca gcgtcctgcc gaccttcccc gacgacacca 54 0 

aactcaccaa aatcgaaacc ctgcgcttcg cttacaacta catctgggcc ctctccgaga 600 

cccttcgttt ggccgagcag tgcctccctc ctccccccgc cttccgcggg ccccccgcgc 660 

ccoccagccc cggcagcgac gccggttcgt ggctgtccag cggttccccg gccgccccct 720 

cgctctgcgc ctccgcctcc ggccccagca gcccggccac ctccgaggac tgcggctacg 780 

tcccctcgga cgccctgcgg gccttccgcg ggctgccccc cgccgccccg ggcgctccct 840 



gccgctagcc ctgcccgtgc gtgtctccgt ccccccccac cttctccgta tcccgttgca 900 

cttttcagcc cctcccgccc cccagcccct ctctccgggg tgccctttcc cttcgccccc 960 

ccgcctcgtt ttccatacga cttggaaaac ccggcaaaga aaagcgacag atttgctgcc 1020 

gcagacgagg tgaaaagtca attttacaat ttgtagctct ccggtgaaga aaaa 1074 



<210> 19 

<211> 178 

<212> PRT 

<213> CHICKEN 

<400> 19 

Met Pro Ala Glu Ala Ala Ser Ser Gly Gly Val Ser Glu Pro Pro Gly 
15 10 15 

Ala Pro Arg Glu Arg Arg Arg Arg Arg Gly Arg Ala Arg Ala Arg Thr 
20 25 30 

Glu Ala Leu Leu His Thr Leu Lys Arg Ser Arg Arg Val Lys Ala Asn 
35 40 45 

Asp Arg Glu Arg Asn Arg Met His His Leu Asn Ala Ala Leu Asp Glu 
50 55 60 

Leu Arg Ser Val Leu Pro Thr Phe Pro Asp Asp Thr Lys Leu Thr Lys 
65 70 75 80 

lie Glu Thr Leu Arg Phe Ala Tyr Asn Tyr He Trp Ala Leu Ser Glu 
85 90 95 

Thr Leu Arg Leu Ala Glu Gin Cys Leu Pro Pro Pro Pro Ala Phe Arg 
100 105 110 

Gly Pro Pro Ala Pro Pro Ser Pro Gly Ser Asp Ala Gly Ser Trp Leu 
115 120 125 

Ser Ser Gly Ser Pro Ala Ala Pro Ser Leu Cys Ala Ser Ala Ser Gly 
130 135 140 

Pro Ser Ser Pro Ala Thr Ser Glu Asp Cys Gly Tyr Val Pro Ser Asp 
145 150 155 160 

Ala Leu Arg Ala Phe Arg Gly Leu Pro Pro Ala Ala Pro Gly Ala Pro 
165 170 175 

Cys Arg 



<210> 20 

<211> 790 

<212> DNA 

<213> CHICKEN 



<400> 20 

gcagccgccg ggcccccggg ccgctccgac gccgcccggg 



cgcgtgccag cgccccccga 



60 



ccgcccgacg gccgtcgacc agcggccgcg cccccctctc ccccgccccc gcaggatgcc 120 

ggtgaaggcg gagagcccgg cgcccgcggc ggaggacgaa ctgctgctgc tgcgcctcgc 180 

ctcgcccgcc ccctcggcct cgctgccgtc cagcgccggc gaggaggacg aggacgagga 24 0 

ggacgggcgg ccgcggcggc tgcaggaggg cgctcggcgg gcggggcggc agcgagggcc 30 0 

cccgcgggcg gcgcgcacgg cggagacggc gcagcgcatc aagcggagcc ggcggctgaa 360 

agccaacaac cgcgagcgca accgcatgca caacctgaac gcggcgctgg acgcgctgcg 420 

cgacgtgctg cccaccttcc ccgaggacgc caagctcacc aagatcgaga cgctgcgctt 480 

cgcccacaac tacatctggg cgctcaccga gacgctgcgc ctggccgggg ccgcccgcct 540 

ggggggcgcc gccgacgccg cgcccggggc ggccgccgag ggcagcccct cgcccgcctc 600 

gtcgtggagc ggcggcgcca gccccgcgcc ctccgcctcg ccctacgcct gcactttatc 660 

gcccggcagc cccgccggct ccgcctcgga cgccgagcac tggccgcccc cgcggggccg 720 

cttcgccccg ccgccgccgc cccaccgctg cctctaacgc ggcccgggcg gccctcgctc 780 

ctccgacgtg 790 

<210> 21 

<211> 213 

<212> PRT 

<213> CHICKEN 

<400> 21 

Met Pro Val Lys Ala Glu Ser Pro Ala Pro Ala Ala Glu Asp Glu Leu 
15 10 15 

Leu Leu Leu Arg Leu Ala Ser Pro Ala Pro Ser Ala Ser Leu Pro Ser 
20 25 30 

Ser Ala Gly Glu Glu Asp Glu Asp Glu Glu Asp Gly Arg Pro Arg Arg 
35 40 45 

Leu Gin Glu Gly Ala Arg Arg Ala Gly Arg Gin Arg Gly Pro Pro Arg 
50 55 60 

Ala Ala Arg Thr Ala Glu Thr Ala Gin Arg lie Lys Arg Ser Arg Arg 
65 70 75 80 

Leu Lys Ala Asn Asn Arg Glu Arg Asn Arg Met His Asn Leu Asn Ala 
85 90 95 

Ala Leu Asp Ala Leu Arg Asp Val Leu Pro Thr Phe Pro Glu Asp Ala 
100 105 110 

Lys Leu Thr Lys He Glu Thr Leu Arg Phe Ala His Asn Tyr He Trp 
115 120 125 

Ala Leu Thr Glu Thr Leu Arg Leu Ala Gly Ala Ala Arg Leu Gly Gly 
130 135 140 





Ala Ala Asp 
145 


Ala 


Ala 


Pro Gly Ala Ala Ala Glu 
150 155 


Gly Ser Pro Ser Pro 
160 






Ala Ser Ser 


Trp 


Ser 
165 


Gly Gly Ala Ser Pro Ala 
170 


Pro Ser Ala Ser Pro 
175 






Tyr Ala Cys 


Thr 
180 


Leu 


Ser Pro Gly Ser Pro Ala 
185 


Gly Ser Ala Ser Asp 
190 






Ala Glu His 
195 


Trp 


Pro 


Pro Pro Arg Gly Arg Phe 
200 


Ala Pro Pro Pro Pro 
205 






Pro His Arg 
210 


Cys 


Leu 










<210> 22 
<211> 450 
<212> DNA 
<213> MOUSE 












<400> 22 
atgaagtcgg 


cctgcaaacc ccacggccct ccggcgggag 


ctcgcggcgc gcccccgtgc 


60 




gcgggcgcag 


ccgagcgcgc ggtctcgtgc gcggggcccg 


ggcggctgga gagcgcggcg 


120 


Q 


cgcaggcgtc 


tggcggccaa cgcgcgcgag cggcgccgca 


tgcaggggct gaacacggcg 


180 


iSSSBS 


ttcgaccggc 


tgcgcagggt ggtgccgcag tggggccagg 


acaagaagct gtccaagtac 


240 




gagacactgc 


agatggcgct cagctacatc atcgcgctca 


cccgcatcct agccgaagcc 


300 




gagcgggact 


gggtcgggct gcgctgcgag cagcggggcc 


gcgatcaccc ctacctccct 


360 




ttcccgggtg 


ctaggctcca ggtagaccct gagccctatg 


ggcagaggct cttcggcttc 


420 




cagccggagc 


ccttccccat ggccagctaa 




450 




<210> 23 
<211> 149 
<212> PRT 
<213> MOUSE 












<400> 23 














Met Lys Ser 
1 


Ala 


Cys 
5 


Lys Pro His Gly Pro Pro 
10 


Ala Gly Ala Arg Gly 
15 






Ala Pro Pro 


Cys 
20 


Ala 


Gly Ala Ala Glu Arg Ala 
25 


Val Ser Cys Ala Gly 
30 






Pro Gly Arg 
35 


Leu 


Glu 


Ser Ala Ala Arg Arg Arg 
40 


Leu Ala Ala Asn Ala 
45 






Arg Glu Arg Arg 
50 


Arg 


Met Gin Gly Leu Asn Thr 
55 


Ala Phe Asp Arg Leu 
60 






Arg Arg Val 
65 


Val 


Pro 


Gin Trp Gly Gin Asp Lys 
70 75 


Lys Leu Ser Lys Tyr 
80 






Glu Thr Leu 


Gin 


Met 


Ala Leu Ser Tyr lie lie 


Ala Leu Thr Arg lie 





85 



90 



95 



Leu Ala Glu Ala 
100 



Glu Arg Asp Trp Val Gly Leu Arg Cys Glu Gin Arg 
105 110 



Gly Arg Asp His 
115 



Pro Tyr Leu Pro Phe Pro Gly Ala Arg Leu Gin Val 
120 125 



Asp Pro Glu Pro 
130 



Tyr Gly Gin Arg Leu Phe Gly Phe Gin Pro Glu Pro 
135 140 



Phe Pro Met Ala Ser 
145 

<210> 24 
<211> 861 
<212> DNA 
<213> MOUSE 



<400> 24 
attcttttga 


gtcgggagaa 


ctaggtaaca 


attcggaaac 


tccaaagggt 


ggatgagggg 


60 


cgcgcggggt 


gtgtgtgggg 


gatactctgg 


tcccccgtgc 


agtgacctct 


aagtcagagg 


120 


ctggcacaca 


cacaccttcc 


attttttccc 


aaccgcagga 


tggcgcctca 


tcccttggat 


180 


gcgctcacca 


tccaagtgtc 


cccagagaca 


caacaacctt 


ttcccggagc 


ctcggaccac 


240 


gaagtgctca 


gttccaattc 


caccccacct 


agccccactc 


tcatacctag 


ggactgctcc 


300 


gaagcagaag 


tgggtgactg 


ccgagggacc 


tcgaggaagc 


tccgcgcccg 


acgcggaggg 


360 


cgcaacaggc 


ccaagagcga 


gttggcactc 


agcaaacagc 


gaagaagccg 


gcgcaagaag 


420 


gccaatgatc 


gggagcgcaa 


tcgcatgcac 


aacctcaact 


cggcgctgga 


tgcgctgcgc 


480 


ggtgtcctgc 


ccaccttccc 


ggatgacgcc 


aaacttacaa 


agatcgagac 


cctgcgcttc 


540 


gcccacaact 


acatctgggc 


actgactcag 


acgctgcgca 


tagcggacca 


cagcttctat 


600 


ggcccggagc 


cccctgtgcc 


ctgtggagag 


ctggggagcc 


ccggaggtgg 


ctccaacggg 


660 


gactggggct 


ctatctactc 


cccagtctcc 


caagcgggta 


acctgagccc 


cacggcctca 


720 


ttggaggaat 


tccctggcct 


gcaggtgccc 


agctccccat 


cctatctgct 


cccgggagca 


780 


ctggtgttct 


cagacttctt 


gtgaagagac 


ctgtctggct 


ctgggtggtg 


ggtgctagtg 


840 



gaaagggagg ggaccacagc c 861 



<210> 25 

<211> 214 

<212> PRT 

<213> MOUSE 



<400> 



25 



Met Ala Pro His Pro Leu Asp Ala Leu Thr lie Gin Val Ser Pro Glu 
15 10 15 



Thr Gin Gin Pro Phe Pro Gly Ala Ser Asp His Glu Val Leu Ser Ser 
20 25 30 

Asn Ser Thr Pro Pro Ser Pro Thr Leu lie Pro Arg Asp Cys Ser Glu 
35 40 45 

Ala Glu Val Gly Asp Cys Arg Gly Thr Ser Arg Lys Leu Arg Ala Arg 
50 55 60 

Arg Gly Gly Arg Asn Arg Pro Lys Ser Glu Leu Ala Leu Ser Lys Gin 
65 70 75 80 

Arg Arg Ser Arg Arg Lys Lys Ala Asn Asp Arg Glu Arg Asn Arg Met 
85 90 95 

His Asn Leu Asn Ser Ala Leu Asp Ala Leu Arg Gly Val Leu Pro Thr 
100 105 110 

Phe Pro Asp Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala 
115 120 125 

His Asn Tyr lie Trp Ala Leu Thr Gin Thr Leu Arg He Ala Asp His 
130 135 140 

Ser Phe Tyr Gly Pro Glu Pro Pro Val Pro Cys Gly Glu Leu Gly Ser 
145 * 150 155 160 

Pro Gly Gly Gly Ser Asn Gly Asp Trp Gly Ser He Tyr Ser Pro Val 
165 170 175 

Ser Gin Ala Gly Asn Leu Ser Pro Thr Ala Ser Leu Glu Glu Phe Pro 
180 185 190 

Gly Leu Gin Val Pro Ser Ser Pro Ser Tyr Leu Leu Pro Gly Ala Leu 
195 200 205 

Val Phe Ser Asp Phe Leu 
210 

<210> 26 

<211> 72 

<212> DNA 

<213> MOUSE 

<400> 26 

atgcacgggc tgaatgcggc gctggacaac ccgagaaagg tggtaccttg ctactctaag 60 
acacagaagc tc 72 



<210> 27 

<211> 24 

<212> PRT 

<213> MOUSE 

<400> 27 

Met His Gly Leu Asn Ala Ala Leu Asp Asn Pro Arg Lys Val Val Pro 
15 10 15 



Cys Tyr Ser Lys Thr Gin Lys Leu 



20 



<210> 28 

<211> 66 

<212> DNA 

<213> MOUSE 

<400> 28 

atgcatggcc tgaatgatgc cttggataat cttagaagag tcatgccatg ttactctaaa 60 
actcaa 66 



<210> 29 

<211> 22 

<212> PRT 

<213> MOUSE 

<400> 29 

Met His Gly Leu Asn Asp Ala Leu Asp Asn Leu Arg Arg Val Met Pro 
15 10 15 

Cys Tyr Ser Lys Thr Gin 
20 

<210> 30 
<211> 1385 
<212> DNA 
<213> MOUSE 

<400> 30 



cttaggaagc 


gccaagcccg 


cggagcggag 


gacaccgtgc 


tcggttccgg 


gtgggggaca 


60 


ttcccggaca 


cacaccggag 


cagcagctgc 


gccggaacat 


tggagccgcg 


taggtaagtg 


120 


tgcatgccgc 


ggctttccat 


tcgcaggcag 


tgtccccacg 


caggctcacg 


ccgcccacgc 


180 


taactccatc 


gtttagacgc 


agtgacttct 


gtgaccggca 


gaaggtggct 


cgagcccggg 


240 


gcgctcctcc 


ccagctctgt 


cctcgccatc 


ttcgcgaatg 


cacattgagg 


gagatggagg 


300 


ggggggggcg 


gggcgcggcg 


ccagcgacac 


tttaccctgt 


ccattctggg 


aataaatttc 


360 


atctgcctct 


tctttctcag 


gatgttcgtc 


aaatctgaga 


ctctggagtt 


gaaggaggaa 


420 


gaggaggtac 


tgatgctgct 


gggctcggct 


tccccggcct 


cggcgaccct 


gaccccgatg 


480 


tcctccagcg 


cggacgagga 


ggaggacgag 


gagctgcgcc 


ggccgggctc 
* 


cgcgcgtggg 


540 


cagcgtggag 


cggaagccga 


gcagggggtg 


cagggcagtc 


cggcgtcggg 


tgccgggggt 


600 


tgccggccag 


ggcggctgct 


gggcctgatg 


cacgagtgca 


agcgtcgccc 


gtcgcgctca 


660 


cgggccgtct 


cccgaggtgc 


caagacggcg 


gagacggtgc 


agcgcatcaa 


gaagacccgc 


720 


aggctcaagg 


ccaacaaccg 


cgagcgcaac 


cgcatgcaca 


acctaaacgc 


cgcgctggac 


780 


gcgctgcgcg 


aggtgctgcc 


caccttcccc 


gaggatgcca 


agctcacgaa 


gatcgagacg 


840 


ctgcgcttcg 


cccacaatta 


catctgggcg 


ctcaccgaga 


ctctgcgcct 


ggcggaccac 


900 



tgcgccggcg ccggtggcct ccagggggcg ctcttcacgg aggcggtgct cctgagcccg 960 

ggagctgcgc tcggcgccag cggggacagc ccttctccac cttcctcctg gagctgcacc 1020 

aacagcccgg cgtcatcctc caactccacg tccccataca gctgcacttt atcgcccgct 1080 

agccccgggt cagacgtgga ctactggcag cccccacctc cggagaagca tcgttatgcg 1140 

cctcacctgc ccctcgccag ggactgtatc tagagctgcg ggtctccctc tctcgtctcc 1200 

tacccgggcc ctccttccca tccttctccc gccccccacc ctccacgccc cggaatccac 1260 

ttcacagaac agaagttggc cctttgcaat cccctccgcg gctggtgctt cgggggttgg 1320 

aaaacaactc tggtttattg aaattaagat tttggtcaaa aagaatatgc tttttggaat 1380 

tgggg 1385 



<210> 31 

<211> 263 

<212> PRT 

<213> MOUSE 

<400> 31 

Met Phe Val Lys Ser Glu Thr Leu Glu Leu Lys Glu Glu Glu Glu Val 
15 10 15 

Leu Met Leu Leu Gly Ser Ala Ser Pro Ala Ser Ala Thr Leu Thr Pro 
20 25 30 

Met Ser Ser Ser Ala Asp Glu Glu Glu Asp Glu Glu Leu Arg Arg Pro 
35 40 45 

Gly Ser Ala Arg Gly Gin Arg Gly Ala Glu Ala Glu Gin Gly Val Gin 
50 55 60 

Gly Ser Pro Ala Ser Gly Ala Gly Gly Cys Arg Pro Gly Arg Leu Leu 
65 70 75 80 

Gly Leu Met His Glu Cys Lys Arg Arg Pro Ser Arg Ser Arg Ala Val 
85 90 95 

Ser Arg Gly Ala Lys Thr Ala Glu Thr Val Gin Arg lie Lys Lys Thr 
100 105 110 

Arg Arg Leu Lys Ala Asn Asn Arg Glu Arg Asn Arg Met His Asn Leu 
115 120 125 

Asn Ala Ala Leu Asp Ala Leu Arg Glu Val Leu Pro Thr Phe Pro Glu 
130 135 140 

Asp Ala Lys Leu Thr Lys He Glu Thr Leu Arg Phe Ala His Asn Tyr 
145 150 155 160 

He Trp Ala Leu Thr Glu Thr Leu Arg Leu Ala Asp His Cys Ala Gly 
165 170 175 

Ala Gly Gly Leu Gin Gly Ala Leu Phe Thr Glu Ala Val Leu Leu Ser 



180 185 190 

Pro Gly Ala Ala Leu Gly Ala Ser Gly Asp Ser Pro Ser Pro Pro Ser 
195 200 205 

Ser Trp Ser Cys Thr Asn Ser Pro Ala Ser Ser Ser Asn Ser Thr Ser 
210 215 220 

Pro Tyr Ser Cys Thr Leu Ser Pro Ala Ser Pro Gly Ser Asp Val Asp 
225 230 235 240 

Tyr Trp Gin Pro Pro Pro Pro Glu Lys His Arg Tyr Ala Pro His Leu 
245 250 255 

Pro Leu Ala Arg Asp Cys lie 
260 

<210> 32 
<211> 3541 
<212> DNA 
<213> MOUSE 



<400> 32 
cagct gggca 


aggaaaatat 


gaaacacaag 


tgatctatat 


ccacagaata 


aacactaagt 


60 


t tatgtgtct 


ttt gacaagg 


tct catgtag 


ct cagactgg 


ggttaaactt 


tttatgctac 


120 


taaagataaa 


cttcaat at g 


ctaataaaga 


t aaactcctg 


atttttctgc 


t t ccacctct 


180 


l a u y y ' ■ Luy 


yuL^u'-u.'.jyi — 


a +* rrt" ctccr t~ c 




r*a crt" t1~ acrat" 


t t at accagg 


240 


tattttgtgg 


tataacattt 


ttatattttc 


ctttttccag 


aatctataca 


tggcaaaaat 


300 


gtatatgaaa 


tccaaggaca 


tggtggagct 


ggtcaacaca 


caatcctgga 


tggacaaagg 


360 


tctgagctct 


caaaatgaga 


tgaaggagca 


agagagaaga 


ccgggctctt 


atggaatgct 


420 


cggaacctta 


actgaagagc 


atgacagtat 


tgaggaggat 


gaagaagagg 


aagaagatgg 


480 


agataaacct 


aaaagaagag 


gtcccaagaa 


aaagaagatg 


actaaagctc 


gccttgaaag 


540 


attcagggct 


cgaagagtca 


aggccaatgc 


tagagaacgg 


acccggatgc 


atggcctgaa 


600 


tgatgccttg 


gataatctta 


ggagagtcat 


gccatgttac 


tctaaaactc 


aaaagctttc 


660 


caagatagag 


actcttcgac 


tggcaaggaa 


ctacatctgg 


gccttgtctg 


aagtcctgga 


720 


gactggtcag 


acacttgaag 


ggaagggatt 


tgtagagatg 


ctatgtaaag 


gtctctctca 


780 


acccacaagc 


aacctggttg 


ctggatgcct 


ccaactgggg 


cctcaatcta 


ccctcctgga 


840 


gaagcatgag 


gaaaaatctt 


caatttgtga 


ctctactatc 


tctgtccaca 


gcttcaacta 


900 


tcagtctcca 


gggctcccca 


gccctcctta 


tggccatatg 


gaaacacatt 


ctctccatct 


960 


caagcctcaa 


ccatttaaga 


gtttgggtga 


ctcttttggg 


agccatccac 


ctgactgcag 


1020 


t accccccct 


tatgagggtc 


cactcacacc 


acccctgagc 


attagtggca 


acttctcctt 


1080 


aaagcaagac 


ggctcccctg 


atttggaaaa 


atcctacaat 


ttcatgccac 


attatacctc 


1140 



tgcaagtcta 


agttcagggc 


atgtgcattc 


aactcccttt 


cagactggca 


ctccccgcta 


1200 


tgatgttcct 


gtagacctga 


gctatgattc 


ctactcccac 


catagcattg 


gaactcagct 


1260 


caatacgatc 


ttctctgatt 


agagcaataa 


gataagcacc 


aatatttcag 


agaatgaagt 


1320 


ggagattttt 


ttcacatttc 


tagtggctga 


gctaaactct 


cagaaaattt 


aaaagaacct 


1380 


t tggatatgc 


atcaaacata 


atagtcctag 


tttgttcaga 


acttcctgta 


cctgctaact 


1440 


t tcttcccat 


taacttctca 


cattggacca 


gtcctacatt 


tggtaaactt 


aagtgaatat 


1500 


atttgatggt 


ttgaggccac 


atggtaatag 


aacagaaaga 


aagcccaggc 


cctgttccaa 


1560 


t ggtgccaaa 


gattaattga 


atgctctgcc 


aattaacttt 


ccatttccag 


tgtttttatt 


1620 


gctttctgat 


aaacatgaag 


caactgttcc 


aaatcaacat 


atacctttca 


cctcccacac 


1680 


atttttaaat 


ttaaaagcaa 


tcaaagcaaa 


tagcaaaaat 


ggaatgatta 


tacagaactg 


1740 


gaaggagcat 


caagtacatg 


tctgttggct 


tatagaatac 


aaaatttgtg 


tgaatttgac 


1800 


aaatcatctt 


tgtgtctatt 


taaatataac 


ttccagagcc 


taaaataata 


acttggatgt 


1860 


taccataaat 


aaccagtatg 


ttctttaaga 


gatcagctct 


acttattact 


gtgctgaaag 


1920 


gtatacacac 


cttttttagt 


gattggagaa 


ccatgataga 


agcctcacac 


aaactttatt 


1980 


ctttatacta 


tttaaaaaac 


aactgtctta 


gtactaggag 


acaagcaaga 


aagacattga 


2040 


aatttctctt 


ctggcacaca 


gaaatatcac 


ctagctcatt 


tcttagctcc 


cgtgactata 


2100 


gggctgtaga 


attttgcagg 


tattcaggtg 


cttcagttag 


aatcagaact 


cagccaggtt 


2160 


catactgtag 


agcaataaaa 


tggtggttgg 


ctgctatccc 


aactaacaca 


gttaaagaac 


2220 


t ctgcctcac 


acatagccac 


tggaaaaatg 


tggatattct 


ccagcaagat 


gaatttcact 


2280 


gtttaaagca 


atgcaattaa 


agccatagag 


tttcgtccac 


tccagtatca 


tatattccag 


2340 


aactgttgta 


atcaatcttg 


aattcttaca 


acataaatgc 


aactccttac 


ttcccaccta 


2400 


acactgattg 


ttatattgtc 


ttcaattcca 


agatattatg 


caattatatg 


caaaattttg 


2460 


attagaatca 


aaattaagag 


tcaatgaatc 


tgtctgtatc 


ttcaggacgg 


gttttgatca 


2520 


gttttaagaa 


agtttatttt 


cctttatgtg 


gcatctcttt 


tctttgtaac 


cacactggtt 


2580 


cagccaagtt 


tctcttctcc 


agagaaatta 


gctctgagaa 


attttactat 


catgatccat 


2640 


cttccacagc 


aattatttag 


gttcaactca 


agagtataca 


tagtttattt 


atagtgggtg 


2700 


aggatacacc 


tccaagaata 


aattttaaca 


acattaatga 


catatgaata 


tgccatttta 


2760 


t ctaccaaac 


tat atatgta 


tct cttttct 


tt attgccct 


tatttatttc 


tt cacactga 


o o o n 
A d Z U 


gagttatttt 


gtgtccatct 


tattgcagca 


cttactctgc 


tctactttgc 


acctttggat 


2880 


tataaatatg 


tttaaaagtc 


tgtaaagacg 


tcttaaacaa 


ctcgtgacag 


taattcacca 


2940 


cccctaagac 


cttgaatcac 


cctagtggaa 


ataggcaagg 


agaattattt 


atagaatcat 


3000 



cctatgtaat 


tttttttgag 


aatttgctct 


acctagcatt 


tatgtttata 


gataattgct 


3060 


atctgcatta 


tttattaggt 


tctatttatt 


taatttatct 


ttctttcttt 


ttatgtaaac 


3120 


atttgtgccc 


catagatata 


gcctcaaagc 


ttcactggga 


aactagctta 


tatgtttgga 


3180 


gtgagagaaa 


aggagaaaat 


cagttcttga 


ttgcttgcaa 


tggttttata 


aaacagagca 


3240 


a taatttgaa 


tagatatgca 


acttaatggt 


tttagaattt 


ttcctttaag 


gtgcaacaga 


3300 


gttacattat 


tatttatgac 


tttggagaat 


gtagtacatg 


tgaaccagga 


ctgtaggctt 


3360 


gtgaagagag 


attttataat 


taaatacaaa 


tttagtactg 


tactatgctt 


ggaaagaact 


3420 


tgttctttaa 


ataatgttta 


gtcttctggg 


agtgttttca 


gataaaatga 


agcaattgtt 


3480 


taaaagaaat 


ggctgttttc 


cctccctttt 


ccagtagcaa 


taaagctttg 


agtgttatta 


3540 


c 












3541 



<210> 33 

<211> 330 

<212> PRT 

<213> MOUSE 

<400> 33 

Met Ala Lys Met Tyr 
1 5 

Thr Gin Ser Trp Met 
20 

Glu Gin Glu Arg Arg 
35 

Glu Glu His Asp Ser 
50 

Asp Lys Pro Lys Arg 
65 

Arg Leu Glu Arg Phe 
85 

Arg Thr Arg Met His 
100 

Val Met Pro Cys Tyr 
115 

Leu Arg Leu Ala Arg 
130 

Thr Gly Gin Thr Leu 
145 



Met Lys Ser Lys Asp 
10 

Asp Lys Gly Leu Ser 
25 

Pro Gly Ser Tyr Gly 
40 

lie Glu Glu Asp Glu 
55 

Arg Gly Pro Lys Lys 
70 

Arg Ala Arg Arg Val 
90 

Gly Leu Asn Asp Ala 
105 

Ser Lys Thr Gin Lys 
120 

Asn Tyr lie Trp Ala 

135 

Glu Gly Lys Gly Phe 
150 



Met Val Glu Leu Val Asn 
15 

Ser Gin Asn Glu Met Lys 
30 

Met Leu Gly Thr Leu Thr 
45 

Glu Glu Glu Glu Asp Gly 
60 

Lys Lys Met Thr Lys Ala 
75 80 

Lys Ala Asn Ala Arg Glu 

95 

Leu Asp Asn Leu Arg Arg 
110 

Leu Ser Lys lie Glu Thr 
125 

Leu Ser Glu Val Leu Glu 
140 

Val Glu Met Leu Cys Lys 
155 160 



Gly Leu Ser Gin Pro Thr Ser Asn Leu Val Ala Gly Cys Leu Gin Leu 
165 170 175 



Gly Pro Gin Ser Thr Leu 
180 

Cys Asp Ser Thr lie Ser 
195 

Leu Pro Ser Pro Pro Tyr 
210 

Lys Pro Gin Pro Phe Lys 
225 230 

Pro Asp Cys Ser Thr Pro 
245 

Ser lie Ser Gly Asn Phe 
260 

Glu Lys Ser Tyr Asn Phe 
275 

Ser Gly His Val His Ser 
290 

Asp Val Pro Val Asp Leu 
305 310 

Gly Thr Gin Leu Asn Thr 
325 

<210> 34 
<211> 800 
<212> DNA 
<213> MOUSE 

<400> 34 



agggtagtag 


agtcactaat 


actacgtcct 


tgctgtgtag 


ttttctcagg 


ttttgattca 


60 


ttttaagcat 


gggccaccaa 


tctcatgtac 


ctttttagct 


ttctttccaa 


gtgtgtctcc 


120 


agttttctga 


cccccctcct 


gttttgcctg 


ctctaccctg 


ccttgtacct 


ttctaatcac 


180 


aagtcttttc 


agttccctta 


gttttcaccc 


atcaacttca 


gcagcccaca 


ccctctagtt 


240 


ccttcctggt 


ttaaacaaaa 


acaaacacgc 


agtggcaaag 


ctggacctgg 


tcagagaagc 


300 


cttgtgaagg 


aggtgtgtct 


ttaggctagg 


aaggggaggg 


gctaccctgt 


gggcaacatc 


360 


tcccgccctg 


gtcagcagcc 


aaaaccagca 


aaacggcggc 


aagtcagaag 


ctccagtcag 


420 


atcacaggag 


ctgcccagag 


actgtggtac 


tgaaagaact 


actcgcggga 


gctgaccccg 


480 


ggaaagaggt 


actgaaaaga 


catagaaaac 


cagctgtggt 


ggaggcactg 


acatgaaggc 


540 


atcctggtag 


tgcatcagaa 


ctccaggaaa 


aagtagaata 


agtaacagcc 


aaggtactca 


600 


gaccaggaac 


agtcactaga 


aggtagctac 


cagtttaaca 


tggacgactg 


aaagggtctt 


660 


ctgtttccca 


cgatctgcct 


ggtcaggtca 


gggtagaact 


gactgctctg 


atagttcttc 


720 



Leu Glu 

Val His 
200 

Gly His 
215 

Ser Leu 
Pro Tyr 



Ser Leu 

Met Pro 
280 

Thr Pro 
295 

Ser Tyr 
He Phe 



Lys His 
185 

Ser Phe 

Met Glu 

Gly Asp 

Glu Gly 
250 

Lys Gin 
265 

His Tyr 

Phe Gin 

Asp Ser 

Ser Asp 
330 



Glu Glu 

Asn Tyr 

Thr His 
220 

Ser Phe 
235 

Pro Leu 

Asp Gly 

Thr Ser 

Thr Gly 
300 

Tyr Ser 
315 



Lys Ser 
190 

Gin Ser 
205 

Ser Leu 

Gly Ser 

Thr Pro 

Ser Pro 
270 

Ala Ser 
285 

Thr Pro 
His His 



Ser He 

Pro Gly 

His Leu 

His Pro 
240 

Pro Leu 
255 

Asp Leu 

Leu Ser 

Arg Tyr 

Ser He 
320 



aggacacaga 


ttagagttta 


atcttggaac 


tggacttcca 


gaggtgagcc 


tgtgaacggg 


780 


gtgtgggtac 


taaagtttct 










800 


<210> 35 
<211> 515 
<212> DNA 
<213> CHICKEN 












<400> 35 
tttttctgta 


tgcgtgtgaa 


gctgcatctg 


caatagatat 


gaaaacctgt 


caatccagtc 


60 


atttggattc 


aggagtagaa 


tcagacatcc 


agtgcagaag 


tggatcaggc 


tgtgttgtga 


120 


agtgcagcac 


agaaagaatg 


gagagcgctg 


ccaagagaag 


actggctgcc 


aacgccaggg 


180 


agagaagacg 


gatgcaagga 


ctgaacacag 


cctttgatcg 


tttgaggaag 


gtggttccac 


240 


agtggggtca 


agataagaag 


ctctccaaat 


atgagaccct 


tcagatggct 


ttgagttata 


300 


tcatggctct 


aacacgaata 


cttgctgaag 


cagagagata 


cagtactgaa 


agagaat gga 


360 


ttaaccttca 


ctgtgaacac 


tttcatccag 


agagctacca 


ccattatacg 


ggacaaaaag 


420 


tggcaacaga 


cagtgatcct 


tatgcacagc 


gaatattcag 


ctatcaccct 


gaacactttc 


480 


aaatagctaa 


ttagaactta 


ttacgagcta 


aaaaa 






515 



<210> 36 

<211> 151 

<212> PRT 

<213> CHICKEN 

<400> 36 

Met Lys Thr Cys Gin 
1 5 

lie Gin Cys Arg Ser 
20 

Arg Met Glu Ser Ala 
35 

Arg Arg Arg Met Gin 
50 

Val Val Pro Gin Trp 
65 

Leu Gin Met Ala Leu 
85 

Glu Ala Glu Arg Tyr 
100 

Glu His Phe His Pro 
115 



Ser Ser His Leu Asp 
10 

Gly Ser Gly Cys Val 
25 

Ala Lys Arg Arg Leu 
40 

Gly Leu Asn Thr Ala 
55 

Gly Gin Asp Lys Lys 
70 

Ser Tyr lie Met Ala 
90 

Ser Thr Glu Arg Glu 
105 

Glu Ser Tyr His His 
120 



Ser Gly Val Glu Ser Asp 
15 

Val Lys Cys Ser Thr Glu 
30 

Ala Ala Asn Ala Arg Glu 
45 

Phe Asp Arg Leu Arg Lys 
60 

Leu Ser Lys Tyr Glu Thr 
75 80 

Leu Thr Arg lie Leu Ala 

95 

Trp lie Asn Leu His Cys 
110 

Tyr Thr Gly Gin Lys Val 
125 



Ala Thr Asp Ser Asp Pro Tyr Ala Gin Arg lie Phe Ser Tyr His Pro 
130 135 140 

Glu His Phe Gin lie Ala Asn 
145 150 

<210> 37 

<211> 1412 

<212> DNA 

<213> MOUSE 

<400> 37 



cgtgctcggt 


tccgggctgc 


ggggacattc 


ccggacacac 


accggagcag 


cagctgcgcc 


60 


gcgacacatc 


tggagccgcg 


taggatgttc 


gtcaaatctg 


agactctgga 


gttgaaggag 


120 


gaagaggagg 


tactgatgct 


gctgggctcg 


gcttccccgg 


cctcggcgac 


cctgaccccg 


180 


atgtcctcca 


gcgcggacga 


ggaggaggac 


gaggagctgc 


gccggccggg 


ctccgcgcgt 


240 


gggcagcgtg 


gagcggaagc 


cgggcagggg 


gtgcagggca 


gtccggcgtc 


gggtgccggg 


300 


ggttgccggc 


cagggcggct 


gctgggcctg 


atgcacgagt 


gcaagcgtcg 


cccgtcgcgc 


360 


tcacgggccg 


tctcccgagg 


tgccaagacg 


gcggagacgg 


tgcagcgcat 


caagaagacc 


420 


cgcaggctca 


aggccaacaa 


ccgcgagcgc 


aaccgcatgc 


acaacctaaa 


cgccgcgctg 


480 


gacgcgctgc 


gcgaggtgct 


gcccaccttc 


cccgaggatg 


ccaagctcac 


gaagatcgag 


540 


acgctgcgct 


tcgcccacaa 


ttacatctgg 


gcgctcaccg 


agactctgcg 


cctggcggac 


600 


cactgcgccg 


gcgccggtgg 


cctccagggg 


gcgctcttca 


cggaggcggt 


gctcctgagc 


660 


ccgggagctg 


cgctcggcgc 


cagcggggac 


agcccttctc 


caccttcctc 


ctggagctgc 


720 


accaacagcc 


cggcgtcatc 


ctccaactcc 


acgtccccat 


acagctgcac 


tttatcgccc 


780 


gctagccccg 


ggtcagacgt 


ggactactgg 


cagcccccac 


ctccggagaa 


gcatcgttat 


840 


gcgcct cacc 




cagggacugu 


a l. c uagagc l 


y L-y y y lllll 


LLLLCLLy LL 


q n o 

z> U \J 


ctctacccgg 


ccctcttccc 


atccttctcc 


cgcccctcac 


cctccacgcc 


ccggactcca 


960 


cttcacagag 


cagaggtggc 


ccttgcaatc 


ccctcggcgg 


ctggtgcatt 


cgggggtgga 


1020 


gaccagctct 


ggtttattga 


agatgtgagg 


atttatggtc 


aaagaggact 


atggcgtgtg 


1080 


ggagtggggg 


ctggcgtggg 


gaacctcgta 


agactgtaaa 


agacactgag 


aaaaagtacc 


1140 


ataactaacg 


agtgtgcaga 


gcagactgac 


gctcctcccc 


tctctcagag 


ctgctggagg 


1200 


agaactccgg 


gcaggcagtt 


cgtgtgaatc 


tctcagaggg 


aatgcaactg 


gtccctgtga 


1260 


tcttttcacc 


ttcgtttcta 


catagagatg 


ttaatgtcag 


tcgaaagaaa 


tgtattttag 


1320 


catctgaatg 


aatttactgg 


taataatatt 


atccacacat 


ttgcaatggc 


tggcatctgc 


1380 


tctattccca 


ttgctgtctg 


caggctgtgg 


ga 






1412 



<210> 38 

<211> 263 

<212> PRT 

<213> MOUSE 

<400> 38 

Met Phe Val Lys 
1 

Leu Met Leu Leu 
20 

Met Ser Ser Ser 
35 

Gly Ser Ala Arg 
50 

Gly Ser Pro Ala 
65 

Gly Leu Met His 



Ser Arg Gly Ala 
100 

Arg Arg Leu Lys 
115 

Asn Ala Ala Leu 
130 

Asp Ala Lys Leu 
145 

lie Trp Ala Leu 



Ala Gly Gly Leu 
180 

Pro Gly Ala Ala 
195 

Ser Trp Ser Cys 
210 

Pro Tyr Ser Cys 
225 

Tyr Trp Gin Pro 



Pro Leu Ala Arg 
260 

<210> 39 
<211> 938 
<212> DNA 



Ser Glu 
5 

Gly Ser 

Ala Asp 

Gly Gin 

Ser Gly 
70 

Glu Cys 
85 

Lys Thr 
Ala Asn 



Asp Ala 



Thr Lys 
150 

Thr Glu 
165 

Gin Gly 

Leu Gly 

Thr Asn 

Thr Leu 
230 

Pro Pro 
245 

Asp Cys 



Thr Leu 



Ala Ser 

Glu Glu 
40 

Arg Gly 
55 

Ala Gly 

Lys Arg 

Ala Glu 

Asn Arg 
120 

Leu Arg 
135 

He Glu 

Thr Leu 

Ala Leu 

Ala Ser 
200 

Ser Pro 
215 

Ser Pro 
Pro Glu 
He 



Glu Leu 

10 

Pro Ala 
25 

Glu Asp 

Ala Glu 

Gly Cys 

Arg Pro 
90 

Thr Val 
105 

Glu Arg 

Glu Val 

Thr Leu 

Arg Leu 
170 

Phe Thr 
185 

Gly Asp 

Ala Ser 

Ala Ser 

Lys His 
250 



Lys Glu 

Ser Ala 

Glu Glu 

Ala Gly 
60 

Arg Pro 
75 

Ser Arg 

Gin Arg 

Asn Arg 

Leu Pro 
140 

Arg Phe 
155 

Ala Asp 

Glu Ala 

Ser Pro 

Ser Ser 
220 

Pro Gly 
235 

Arg Tyr 



Glu Glu 

Thr Leu 
30 

Leu Arg 
45 

Gin Gly 

Gly Arg 

Ser Arg 

He Lys 
110 

Met His 
125 

Thr Phe 

Ala His 

His Cys 

Val Leu 
190 

Ser Pro 
205 

Asn Ser 



Ser Asp 



Ala Pro 



Glu Val 
15 

Thr Pro 
Arg Pro 
Val Gin 

Leu Leu 

80 

Ala Val 
95 

Lys Thr 



Asn Leu 



Pro Glu 



Asn Tyr 
160 

Ala Gly 
175 

Leu Ser 

Pro Ser 

Thr Ser 

Val Asp 
240 

His Leu 
255 



<213> ZEBRA FISH 
<400> 39 



cttatagggc 


tcgagcggcc 


gcccgggcag 


gtcaggtgta 


agaagagaaa 


tcttcccagg 


60 


caaaatatcc 


gtccctgtat 


ccatagccac 


aaactttcct 


cccaaaagca 


caaaccaaca 


120 


gaatggatgg 


aatgagcacg 


gatacaagag 


aggtggttga 


actcgacgtc 


cagcattcga 


180 


gcttggggcg 


gggggagcag 


agcaagtacc 


caccagcctt 


ggcactcatg 


gccagcagtg 


240 


acccacgcgc 


ctggctggct 


cccgtgcagg 


ctggcacctg 


cgcggcacac 


gccgaatacc 


300 


tgctgcactc 


gcccggctcg 


agcgcggaag 


gcgtgtcctc 


tgcctccaac 


ttcaggaaga 


360 


gcagcaagag 


tcctgtcaaa 


gtacgcgagc 


tctgccggct 


taaaggagct 


gtgggggcag 


420 


atgagggcag 


acagcgggcc 


ccatccagca 


aatccaccaa 


cgtcgtgcag 


aaacagaggc 


480 


gaatggctgc 


caatgcccgg 


gagaggcgaa 


gaatgcacgg 


attgaaccac 


gcgttcgacg 


540 


agctgcgcag 


tgtcatccca 


gcctttgaca 


acgacaagaa 


actctccaag 


tacgaaaccc 


600 


t gcagatggc 


ccagatctac 


atcaacgccc 


tgtccgactt 


actacagggc 


cccggtgcta 


660 


aagccgaccc 


gccaaactgc 


gacctgctgc 


atgccaacgt 


gttagaaacg 


gaccgatctc 


720 


ccagaggatc 


accgggcgtc 


tgtcggagag 


gcacgggcgt 


gggttacccg 


taccagtacg 


780 


aggacggaac 


attcaactct 


ttcatggagc 


aagacctcca 


gtcgccctct 


ggaacgagca 


840 


agtctggttc 


ggaggccagt 


aaagactcgc 


ctcggtcgaa 


ccggagtgat 


ggagaagttc 


900 


tcgcctcact 


gaagtgcgag 


tgagacctgc 


ccgggcgg 






938 



<210> 40 

<211> 266 

<212> PRT 

<213> ZEBRA FISH 

<400> 40 

Met Asp Gly Met Ser Thr Asp Thr Arg Glu Val Val Glu Leu Asp Val 
15 10 15 

Gin His Ser Ser Leu Gly Arg Gly Glu Gin Ser Lys Tyr Pro Pro Ala 
20 25 30 

Leu Ala Leu Met Ala Ser Ser Asp Pro Arg Ala Trp Leu Ala Pro Val 
35^ 40 45 

Gin Ala Gly Thr Cys Ala Ala His Ala Glu Tyr Leu Leu His Ser Pro 
50 55 60 

Gly Ser Ser Ala Glu Gly Val Ser Ser Ala Ser Asn Phe Arg Lys Ser 
65 70 75 80 

Ser Lys Ser Pro Val Lys Val Arg Glu Leu Cys Arg Leu Lys Gly Ala 
85 90 95 



Val Gly Ala Asp Glu Gly Arg Gin Arg Ala Pro Ser Ser Lys Ser Thr 
100 105 HO 

Asn Val Val Gin Lys Gin Arg Arg Met Ala Ala Asn Ala Arg Glu Arg 
115 120 125 

Arg Arg Met His Gly Leu Asn His Ala Phe Asp Glu Leu Arg Ser Val 
1.30 135 140 

He Pro Ala Phe Asp Asn Asp Lys Lys Leu Ser Lys Tyr Glu Thr Leu 
145 150 155 160 

Gin Met Ala Gin He Tyr He Asn Ala Leu Ser Asp Leu Leu Gin Gly 
165 170 175 

Pro Gly Ala Lys Ala Asp Pro Pro Asn Cys Asp Leu Leu His Ala Asn 
180 185 190 

Val Leu Glu Thr Asp Arg Ser Pro Arg Gly Ser Pro Gly Val Cys Arg 
195 200 205 

Arg Gly Thr Gly Val Gly Tyr Pro Tyr Gin Tyr Glu Asp Gly Thr Phe 
210 215 220 

Asn Ser Phe Met Glu Gin Asp Leu Gin Ser Pro Ser Gly Thr Ser Lys 
225 230 235 240 

Ser Gly Ser Glu Ala Ser Lys Asp Ser Pro Arg Ser Asn Arg Ser Asp 
245 250 255 

Gly Glu Val Leu Ala Ser Leu Lys Cys Glu 
260 265 

<210> 41 
<211> 948 
<212> DNA 
<213> FROG 

<400> 41 

atgtcagaga tggtcaatgt gcatgggtgg atggaggaag cccttagttc ccaggatgag 
atgaaggaga ggaatcagtc tgcctatgat atcatttcag gtctatgcca tgaggaaagg 
ggcagcattg atggagaaga ggatgatgaa gaagaagagg atggagagaa accaaaaaag 
aggggaccca aaaaaaagaa gatgaccaag gctagagtgg agaggttccg tgtccgtaga 
gtaaaagcca atgccaggga gcgttcaaga atgcatggac ttaatgatgc cctggaaaat 
ttgagaaggg ttatgccttg ctattccaaa acacaaaagt tgtctaaaat tgagactctt 
agactggcca gaaactatat atgggcatta tctgatattc tagaacaagg tcaaaatgca 
gagggaaagg gctttctgga aatactctgc aaaggtcttt ctcagccaac aagcaactta 
gtagctggct gcttgcaact tggacctcag gccatgttct tggataaaca cgaagaaaag 
tctcatatat gtgattcctc tcttactggt catacttata attaccagtc cccaggacta 
cccagtcctc cttatggtaa cattgatgtt caccacttgc acttgaaacc ctcttctttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



aaaccagtaa tggatccttc tgtggtaacc catacactta actgtaccac tccaccatat 720 

gaaggagctc taacacctcc actcagcatc ggtggtaatt tttctttgaa gcaagatagt 780 

tcacccgata tggataaatc atatgcattc aggtccccct atccagctct tgggcttggt 840 

ggatctcatg gacatgcgtc acactttcat accagtgttc caaggtatga actacccata 900 

gacatggctt acgagcctta cccacaccat gctatattca ctgaataa 948 



<210> 42 

<211> 315 

<212> PRT 

<213> FROG 

<400> 42 

Met Ser Glu Met Val Asn Val His Gly Trp Met Glu Glu Ala Leu Ser 
15 10 15 

Ser Gin Asp Glu Met Lys Glu Arg Asn Gin Ser Ala Tyr Asp lie lie 
20 25 30 

Ser Gly Leu Cys His Glu Glu Arg Gly Ser lie Asp Gly Glu Glu Asp 
35 40 45 

Asp Glu Glu Glu Glu Asp Gly Glu Lys Pro Lys Lys Arg Gly Pro Lys 
50 55 60 

Lys Lys Lys Met Thr Lys Ala Arg Val Glu Arg Phe Arg Val Arg Arg 
65 70 75 80 

Val Lys Ala Asn Ala Arg Glu Arg Ser Arg Met His Gly Leu Asn Asp 
85 90 95 

Ala Leu Glu Asn Leu Arg Arg Val Met Pro Cys Tyr Ser Lys Thr Gin 
100 105 110 

Lys Leu Ser Lys lie Glu Thr Leu Arg Leu Ala Arg Asn Tyr lie Trp 
115 120 125 

Ala Leu Ser Asp lie Leu Glu Gin Gly Gin Asn Ala Glu Gly Lys Gly 
130 135 140 

Phe Leu Glu lie Leu Cys Lys Gly Leu Ser Gin Pro Thr Ser Asn Leu 
145 150 155 160 

Val Ala Gly Cys Leu Gin Leu Gly Pro Gin Ala Met Phe Leu Asp Lys 
165 170 175 

His Glu Glu Lys Ser His lie Cys Asp Ser Ser Leu Thr Gly His Thr 
180 185 190 

Tyr Asn Tyr Gin Ser Pro Gly Leu Pro Ser Pro Pro Tyr Gly Asn lie 
195 200 205 

Asp Val His His Leu His Leu Lys Pro Ser Ser Phe Lys Pro Val Met 
210 215 220 

Asp Pro Ser Val Val Thr His Thr Leu Asn Cys Thr Thr Pro Pro Tyr 



225 



230 235 240 



Glu Gly Ala Leu Thr Pro Pro Leu Ser lie Gly Gly Asn Phe Ser Leu 
245 250 255 

Lys Gin Asp Ser Ser Pro Asp Met Asp Lys Ser Tyr Ala Phe Arg Ser 
260 265 270 

Pro Tyr Pro Ala Leu Gly Leu Gly Gly Ser His Gly His Ala Ser His 
275 280 285 

Phe His Thr Ser Val Pro Arg Tyr Glu Leu Pro He Asp Met Ala Tyr 
290 295 300 

Glu Pro Tyr Pro His His Ala He Phe Thr Glu 
305 310 315 

<210> 43 

<211> 1550 

<212> DNA 

<213> MOUSE 

<400> 43 

tttaaatact gtgctggcat tttttttcag gtaaattaag gttaaagaac catgttaaca bO 
ctaccgtttg acgagtctgt cgtaatgccc gaatcccaga tgtgcagaaa gtttgctaga 120 
caatgtgagg accagaaaca aattaagaaa ccagagagct ttccaaaaca agttgtcctt 
cgaggaaaga gcattaaaag ggcccctgga gaagaaaccg agaaagaaga ggaggaagaa 
gacagagagg aagaagatga gaatggcttg tccagaagga gggggctcag gaaaaaaaag 
accaccaaac tacgactgga aagggtcaag ttcaggagac aggaagctaa tgcgcgcgag 
aggaaccgga tgcacggcct caatgatgct ctggacaatt tgcgaaaagt ggtcccctgt 420 
tactctaaaa cccaaaaact gtccaaaata gaaactttac gactggccaa aaattacatc 480 
tgggcacttt ctgaaattct gaggattggc aagagaccgg atctgctcac gttcgtccaa 540 
aacttatgca aaggtctttc ccagccaact acaaacttgg tggcaggctg cttacagctc 600 
aacgccagaa gtttcctgat gggtcagggt ggggaggctg cccaccacac aaggtcaccc 
tactccacat tctacccacc ctaccacagc cctgagctgg ccactccccc agggcatggg 
actcttgata attccaagtc catgaaaccc tacaattact gcagtgcata tgaatccttc 
tatgaaagta cctcccctga gtgtgccagc cctcagtttg aaggtccctt aagtcctccc 
ccaattaact ataatgggat attttccctg aagcaagaag aaaccttgga ctatggcaaa 
aattacaatt atggcatgca ttactgtgca gtgccaccca ggggtcccct tgggcagggt 
gccatgttca ggttgcccac cgacagccac ttcccttacg acttacatct gcgcagccaa 
tctctcacta tgcaagatga attaaatgca gtttttCata attaatgagg aaaattaaaa 
taaacagtgg tcattcacct cccactctaa ttaaggcaaa gcagatgctt gtgggctgag 1140 



180 
240 
300 
360 



660 
720 
780 
840 
900 
960 
1020 
1080 



taattggcac aactctatct aaggtgttta ctagtttctg aagtgtgttt caaagattgt 1200 

gaccattttc tatgtcataa taaatccctt ttcgtatgag aacttccttt ccttccctct 1260 

tgtctgtatc acactgtgat tctctctctc tctctctctc tctctctctc tctctctctc 1320 

tcttactggc agaatatttc tttcttgttt tagtttcttt caaattcact taatttgttt 1380 

gaacaaggtg tctaagatgt tgctgaataa agacatgcac acagcatact tcaatgtcta 1440 

tttcagttgt acagttatga tgaaaatgca tgttataaaa atcagatgag taaaatgtgt 1500 

ttataattac taggattcat atatgtatct ctgaaatttt agtttttaaa 1550 



<210> 44 

<211> 337 

<212> PRT 

<213> MOUSE 

<400> 44 

Met Leu Thr Leu Pro Phe Asp Glu Ser Val Val Met Pro Glu Ser Gin 
15 10 15 

Met Cys Arg Lys Phe Ala Arg Gin Cys Glu Asp Gin Lys Gin lie Lys 
20 25 30 

Lys Pro Glu Ser Phe Pro Lys Gin Val Val Leu Arg Gly Lys Ser lie 
35 40 45 

Lys Arg Ala Pro Gly Glu Glu Thr Glu Lys Glu Glu Glu Glu Glu Asp 
50 55 60 

Arg Glu Glu Glu Asp Glu Asn Gly Leu Ser Arg Arg Arg Gly Leu Arg 
65 70 75 80 

Lys Lys Lys Thr Thr Lys Leu Arg Leu Glu Arg Val Lys Phe Arg Arg 
85 90 95 

Gin Glu Ala Asn Ala Arg Glu Arg Asn Arg Met His Gly Leu Asn Asp 
100 105 110 

Ala Leu Asp Asn Leu Arg Lys Val Val Pro Cys Tyr Ser Lys Thr Gin 
115 120 125 

Lys Leu Ser Lys lie Glu Thr Leu Arg Leu Ala Lys Asn Tyr lie Trp 
130 135 140 

Ala Leu Ser Glu lie Leu Arg lie Gly Lys Arg Pro Asp Leu Leu Thr 
145 150 155 160 

Phe Val Gin Asn Leu Cys Lys Gly Leu Ser Gin Pro Thr Thr Asn Leu 
165 170 175 

Val Ala Gly Cys Leu Gin Leu Asn Ala Arg Ser Phe Leu Met Gly Gin 
180 185 190 

Gly Gly Glu Ala Ala His His Thr Arg Ser Pro Tyr Ser Thr Phe Tyr 
195 200 205 



Pro Pro Tyr His Ser 
210 

Leu Asp Asn Ser Lys 
225 

Glu Ser Phe Tyr Glu 
245 

Glu Gly Pro Leu Ser 
260 

Leu Lys Gin Glu Glu 
275 

Met His Tyr Cys Ala 
290 

Met Phe Arg Leu Pro 
305 

Arg Ser Gin Ser Leu 
325 

Asn 

<210> 45 
<211> 1393 
<212> DNA 
<213> MOUSE 

<400> 45 



aagcttcgtt 


gcacgcgacc 


tggtgtgcga 


tctccgagtg 


agagggggag 


ggtcagagga 


60 


ggaaggaaaa 


aaaatcagac 


cttgcagaag 


agactaggaa 


ggtttttgtt 


gttgttgttc 


120 


ggggcttatc 


cccttcgttg 


aactgggttg 


ccagcacctc 


ctctaacacg 


gcacctccga 


180 


gccattgcag 


tgcgatgtcc 


cgcctgctgc 


atgcagaaga 


gtgggctgag 


gtaaaagagt 


240 


tgggggacca 


ccatcgccat 


ccccagccgc 


accacgtccc 


gccgctgacg 


ccacagccac 


300 


ctgctaccct 


gcaggcgaga 


gaccttcccg 


tctacccggc 


agaactgtcc 


ctcctggata 


360 


gcaccgaccc 


acgcgcctgg 


ctgactccca 


ctttgcaggg 


cctctgcacg 


gcacgcgccg 


420 


cccagtatct 


gctgcattct 


cccgagctgg 


gtgcctccga 


ggccgcggcg 


ccccgggacg 


480 


aggctgacag 


ccagggtgag 


ctggtaagga 


gaagcggctg 


tggcggcctc 


agcaagagcc 


540 


ccgggcccgt 


caaagtacgg 


gaacagctgt 


gcaagctgaa 


gggtggggtt 


gtagtggacg 


600 


agcttggctg 


cagccgccag 


cgagcccctt 


ccagcaaaca 


ggtgaatggg 


gtacagaagc 


660 


aaaggaggct 


ggcagcaaac 


gcaagggaac 


ggcgcaggat 


gcacgggctg 


aaccacgcct 


720 


tcgaccagct 


gcgcaacgtt 


atcccgtcct 


tcaacaacga 


caagaagctg 


tccaaatatg 


780 


agaccctaca 


gatggcccag 


atctacatca 


acgctctgtc 


ggagttgctg 


cagactccca 


840 



Pro Glu Leu Ala Thr 
215 

Ser Met Lys Pro Tyr 
230 

Ser Thr Ser Pro Glu 
250 

Pro Pro Pro lie Asn 
265 

Thr Leu Asp Tyr Gly 
280 

Val Pro Pro Arg Gly 
295 

Thr Asp Ser His Phe 
310 

Thr Met Gin Asp Glu 
330 



Pro Pro Gly His Gly Thr 
220 

Asn Tyr Cys Ser Ala Tyr 
235 240 

Cys Ala Ser Pro Gin Phe 
255 

Tyr Asn Gly lie Phe Ser 
270 

Lys Asn Tyr Asn Tyr Gly 
285 

Pro Leu Gly Gin Gly Ala 
300 

Pro Tyr Asp Leu His Leu 
315 320 

Leu Asn Ala Val Phe His 
335 



atgtcggaga 


gcaaccgccg 


ccgcccacag 


cttcctgcaa 


aaatgaccac 


catcaccttc 


900 


gcaccgcctc 


ctcctatgaa 


ggaggtgcgg 


gcgcctctgc 


ggtagctggg 


gctcagccag 


960 


ccccgggagg 


gggcccgaga 


cctaccccgc 


ccgggccttg 


ccggactcgc 


ttctcaggcc 


1020 


cagcttcctc 


tgggggttac 


tcggtgcagc 


tggacgcttt 


gcacttccca 


gccttcgagg 


1080 


acagggccct 


aacagcgatg 


atggcacaga 


aggacctgtc 


gccttcgctg 


cccgggggca 


1140 


tcctgcagcc 


tgtacaggag 


gacaacagca 


aaacatctcc 


cagatcccac 


agaagtgacg 


1200 


gagagttttc 


cccccactct 


cattacagtg 


actctgatga 


ggccagttag 


gaaggcaaca 


1260 


gctccctgaa 


aactgagaca 


accaaatgcc 


cttcctagcg 


cgcgggaagc 


cccgtgacaa 


1320 


atatccctgc 


accctttaat 


ttttggtctg 


tggtgatcgt 


tgttagcaac 


gacttgactt 


1380 


cggacggctg 


cag 










1393 



<210> 46 

<211> 351 

<212> PRT 

<213> MOUSE 

<400> 46 

Met Ser Arg Leu Leu His Ala Glu 
1 5 

Gly Asp His His Arg His Pro Gin 
20 

Pro Gin Pro Pro Ala Thr Leu Gin 
35 40 

Ala Glu Leu Ser Leu Leu Asp Ser 
50 55 

Pro Thr Leu Gin Gly Leu Cys Thr 
65 70 

His Ser Pro Glu Leu Gly Ala Ser 
85 

Ala Asp Ser Gin Gly Glu Leu Val 
100 

Ser Lys Ser Pro Gly Pro Val Lys 
115 120 

Lys Gly Gly Val Val Val Asp Glu 
130 135 

Pro Ser Ser Lys Gin Val Asn Gly 
145 150 

Ala Asn Ala Arg Glu Arg Arg Arg 
165 



Glu Trp Ala Glu Val Lys Glu Leu 
10 15 

Pro His His Val Pro Pro Leu Thr 
25 30 

Ala Arg Asp Leu Pro Val Tyr Pro 
45 

Thr Asp Pro Arg Ala Trp Leu Thr 
60 

Ala Arg Ala Ala Gin Tyr Leu Leu 
75 80 

Glu Ala Ala Ala Pro Arg Asp Glu 
90 95 

Arg Arg Ser Gly Cys Gly Gly Leu 
105 110 

Val Arg Glu Gin Leu Cys Lys Leu 
125 

Leu Gly Cys Ser Arg Gin Arg Ala 

140 

Val Gin Lys Gin Arg Arg Leu Ala 
155 160 



Met His Gly Leu Asn His Ala Phe 
170 175 



Asp Gin Leu Arg Asn 
180 

Ser Lys Tyr Glu Thr 
195 

Ser Glu Leu Leu Gin 
210 

Thr Ala Ser Cys Lys 
225 

Tyr Glu Gly Gly Ala 
245 

Pro Gly Gly Gly Pro 
260 

Phe Ser Gly Pro Ala 
275 

Leu His Phe Pro Ala 
290 

Gin Lys Asp Leu Ser 
305 

Gin Glu Asp Asn Ser 

325 

Glu Phe Ser Pro His 
340 

<210> 47 
<211> 993 
<212> DNA 
<213> MOUSE 

<400> 47 



atggcaaaaa 


tgtatatgaa 


atccaaggac 


atggtggagc 


tggtcaacac 


acaatcctgg 


60 


atggacaaag 


gtctgagctc 


tcaaaatgag 


atgaaggagc 


aagagagaag 


accgggctct 


120 


tatggaatgc 


tcggaacctt 


aactgaagag 


catgacagta 


ttgaggagga 


tgaagaagag 


180 


gaagaagatg 


gagataaacc 


taaaagaaga 


ggtcccaaga 


aaaagaagat 


gactaaagct 


240 


cgccttgaaa 


gattcagggc 


tcgaagagtc 


aaggccaatg 


ctagagaacg 


gacccggatg 


300 


catggcctga 


atgatgcctt 


ggataatctt 


aggagagtca 


tgccatgtta 


ctctaaaact 


360 


caaaagcttt 


ccaagataga 


gactcttcga 


ctggcaagga 


actacatctg 


ggccttgtct 


420 


gaagtcctgg 


agactggtca 


gacacttgaa 


gggaagggat 


ttgtagagat 


gctatgtaaa 


480 


ggtctctctc 


aacccacaag 


caacctggtt 


gctggatgcc 


tccaactggg 


gcctcaatct 


540 


accctcctgg 


agaagcatga 


ggaaaaatct 


tcaatttgtg 


actctactat 


ctctgtccac 


600 


agcttcaact 


atcagtctcc 


agggctcccc 


agccctcctt 


atggccatat 


ggaaacacat 


660 



Val lie Pro Ser Phe Asn Asn Asp Lys Lys Leu 
185 190 

Leu Gin Met Ala Gin lie Tyr lie Asn Ala Leu 
200 205 

Thr Pro Asn Val Gly Glu Gin Pro Pro Pro Pro 
215 220 

Asn Asp His His His Leu Arg Thr Ala Ser Ser 
230 235 240 

Gly Ala Ser Ala Val Ala Gly Ala Gin Pro Ala 
250 255 

Arg Pro Thr Pro Pro Gly Pro Cys Arg Thr Arg 
265 270 

Ser Ser Gly Gly Tyr Ser Val Gin Leu Asp Ala 
280 285 

Phe Glu Asp Arg Ala Leu Thr Ala Met Met Ala 
295 300 

Pro Ser Leu Pro Gly Gly lie Leu Gin Pro Val 
310 315 320 

Lys Thr Ser Pro Arg Ser His Arg Ser Asp Gly 
330 335 



Ser His Tyr Ser Asp Ser Asp Glu Ala Ser 
345 350 



840 
900 
960 
993 



tctctccatc tcaagcctca accatttaag agtttgggtg actcttttgg gagccatcca 720 
cctgactgca gtaccccccc ttatgagggt ccactcacac cacccctgag cattagtggc 7 80 
aacttctcct taaagcaaga cggctcccct gatttggaaa aatcctacaa tttcatgcca 
cattatacct ctgcaagtct aagttcaggg catgtgcatt caactccctt tcagactggc 
actccccgct atgatgttcc tgtagacctg agctatgatt cctactccca ccatagcatt 
ggaactcagc tcaatacgat cttctctgat tag 



<210> 48 

<211> 330 

<212> PRT 

<213> MOUSE 

<400> 48 

Met Ala Lys Met Tyr Met Lys Ser Lys Asp Met Val Glu Leu Val Asn 
15 10 15 

Thr Gin Ser Trp Met Asp Lys Gly Leu Ser Ser Gin Asn Glu Met Lys 
20 25 30 

Glu Gin Glu Arg Arg Pro Gly Ser Tyr Gly Met Leu Gly Thr Leu Thr 
35 40 45 

Glu Glu His Asp Ser He Glu Glu Asp Glu Glu Glu Glu Glu Asp Gly 
50 55 60 

Asp Lys Pro Lys Arg Arg Gly Pro Lys Lys Lys Lys Met Thr Lys Ala 
65 70 75 80 

Arg Leu Glu Arg Phe Arg Ala Arg Arg Val Lys Ala Asn Ala Arg Glu 
85 90 95 

Arg Thr Arg Met His Gly Leu Asn Asp Ala Leu Asp Asn Leu Arg Arg 
100 105 HO 

Val Met Pro Cys Tyr Ser Lys Thr Gin Lys Leu Ser Lys He Glu Thr 
115 120 125 

Leu Arg Leu Ala Arg Asn Tyr He Trp Ala Leu Ser Glu Val Leu Glu 
130 135 140 

Thr Gly Gin Thr Leu Glu Gly Lys Gly Phe Val Glu Met Leu Cys Lys 
145 150 155 160 

Gly Leu Ser Gin Pro Thr Ser Asn Leu Val Ala Gly Cys Leu Gin Leu 
165 170 175 

Gly Pro Gin Ser Thr Leu Leu Glu Lys His Glu Glu Lys Ser Ser He 
180 185 190 

Cys Asp Ser Thr He Ser Val His Ser Phe Asn Tyr Gin Ser Pro Gly 
195 200 205 

Leu Pro Ser Pro Pro Tyr Gly His Met Glu Thr His Ser Leu His Leu 
210 215 220 



Lys Pro Gin Pro Phe Lys Ser Leu Gly Asp Ser Phe Gly Ser His Pro 
225 230 235 240 

Pro Asp Cys Ser Thr Pro Pro Tyr Glu Gly Pro Leu Thr Pro Pro Leu 
245 250 255 

Ser lie Ser Gly Asn Phe Ser Leu Lys Gin Asp Gly Ser Pro Asp Leu 
260 265 270 

Glu Lys Ser Tyr Asn Phe Met Pro His Tyr Thr Ser Ala Ser Leu Ser 
275 280 285 

Ser Gly His Val His Ser Thr Pro Phe Gin Thr Gly Thr Pro Arg Tyr 
290 295 300 

Asp Val Pro Val Asp Leu Ser Tyr Asp Ser Tyr Ser His His Ser lie 
305 310 315 320 

Gly Thr Gin Leu Asn Thr lie Phe Ser Asp 
325 330 

<210> 49 
<211> 2264 
<212> DNA 
<213> FROG 



<400> 49 
gacagtcagt 


agatgctcct 


ttcccaaagt 


gtatcttgca 


gtagccttac 


actaatactg 


60 


cctgtcttgt 


agctggagag 


ctttgtatcg 


atgctaagag 


cactgggtat 


ctacaaagag 


120 


gatcactgca 


tatgaatgga 


ataaggagtg 


ctgctgctac 


ccaggctggg 


gtttgttccg 


180 


agcccttcaa 


aaccttttgg 


ccatagaatc 


actgtgttga 


catgaagtca 


gattcaccag 


240 


tgcatgggga 


gtcccatact 


gaatgccagt 


caccatgccc 


actaagttgc 


atgccagcca 


300 


ggctggaagg 


ctctaccaag 


agacgtctgg 


ctgccaatgc 


cagggaaaga 


aggagaatgc 


360 


aaggactgaa 


taccgccttc 


gatagtctga 


ggaaagttgt 


accgcaatgg 


ggtgaggaca 


420 


aaaaactttc 


caagtatgag 


actctacaga 


tggcactgag 


ctacatcatg 


gcactaagca 


480 


ggatcctcac 


ggaagcagaa 


agatacagca 


gaactgatcc 


aggggaatgg 


actaaaatgc 


540 


actttgatca 


cattcaggaa 


gaacagtgcc 


tcagttatat 


gggagtgaga 


tgcccaagag 


600 


actgtgatcg 


ctacctgccc 


cagacttttt 


ctcactagga 


taggagatgt 


gagcaacagt 


660 


cagcaggcaa 


ggtactatag 


acctgaagat 


agcagtgtat 


tcctacacac 


agcagccaat 


720 


aatacaggga 


catttgcatc 


atgggttatt 


tgtcatgtca 


ttctgcccaa 


tgcactgctt 


780 


atttcattaa 


gcacccaaag 


tcccaggact 


gggaatatat 


gtagggcacc 


ccacgtgatg 


840 


cagcccaaag 


tatgtgctgc 


tgcaactgat 


agtgagctgt 


gggacactgg 


aaaagcaaag 


900 


tgcgctggta 


ttttgtaaat 


gaaaatgtca 


ttatgggtgg 


catataataa 


ttacttacac 


960 


acagcacagt 


tatataattt 


cattgctgtt 


agaaagcccc 


tttgtctctt 


accccccatc 


1020 



ctgctttcca tgttactaat gcactggtat cagtgaacat gtagaaggta gtatttatac 1080 

attagaaaaa cactgaatgt gcataaacat catttttctc aatatacatt tactatatag 1140 

ttcagttggt aagtgctggt aaacagcagc tgcccacaag ttctgtgcac tgctcatagg 1200 

ggaaaggaaa catttttgcc acttgctaga gaatgctaat tgcattgtac tctgtaatat 1260 

tgtcttgtgt ccagggtaag aaaggtctaa attaaaatca ataggaggca taatttacac 1320 

atttccctta agctgtaatt ttctggcttt tgtctgcatt ttaatagtgg aaaattacaa 1380 

cgtgtataaa attcctactg gtctcattct acttgtttct gtaagacaaa ccctggtgtg 1440 

tttttaagga aaccttaagt aattcttttg aatagtatac atttcttgaa tgtttttttt 1500 

ttttttttca atgggcatga gttgtttatt tgtaaacaca gctccaaagt taacagattg 1560 

cttgtatctt tcaggtatgc aaatggtatt ttagtctatt ctgggacttt gaaatagaaa 1620 

gcaatatgca accctttaga attagaaaat gtagtacaac ggaacaaatg gtgaatggct 1680 

tggcagagta ctgtacatac tgacctatgg agcatagttg gctaaatcag tctgcagttc 1740 

acttttactg gtgtgttgca gttagaataa taaaatcgga tgttgctaag gttgttacag 1800 

tatttctttg ccccagttaa tgcatgacct cccatcttac agagtttcta acacgttgcc 1860 

aatgtattgt tatctttcga aaaatgtgga tggaacgtga tcaggtgcaa attcatgcga 1920 

tttacctgtg cataattcct ataaaacaga gacaatgtgt agttatagga ggattcacta 1980 

cacggatgaa agatcttatt tacaacaggt taggcacaaa aagtgcagga ccatgaaaaa 2040 

ttgtggccct ttagatttac taaattctgg cacccagcca ttgtgatgct gcacctgcac 2100 

tcagcaccat attataatcc agcacaaggc agtaagtata aggctcccat actcttctca 2160 

ctgataagct tgctagagat gttcccaagg aagcccatgt gctgcccatc ttctgccctt 2220 

cctctaggca gcactgcctc acatgcggaa tgaaggccca agcg 2264 

<210> 50 
<211> 138 
<212> PRT 
<213> FROG 

<400> 50 

Met Lys Ser Asp Ser Pro Val His Gly Glu Ser His Thr Glu Cys Gin 



Ser Pro Cys Pro Leu Ser Cys Met Pro Ala Arg Leu Glu Gly Ser Thr 
20 25 30 

Lys Arg Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Gin Gly 
35 40 45 

Leu Asn Thr Ala Phe Asp Ser Leu Arg Lys Val Val Pro Gin Trp Gly 



50 55 60 

Glu Asp Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Leu Ser 
65 70 75 80 

Tyr lie Met Ala Leu Ser Arg lie Leu Thr Glu Ala Glu Arg Tyr Ser 

85 90 95 

Arg Thr Asp Pro Gly Glu Trp Thr Lys Met His Phe Asp His lie Gin 
100 105 110 

Glu Glu Gin Cys Leu Ser Tyr Met Gly Val Arg Cys Pro Arg Asp Cys 
115 120 125 

Asp Arg Tyr Leu Pro Gin Thr Phe Ser His 
130 135 

<210> 51 

<211> 2123 

<212> DNA 

<213> FROG 



<400> 51 



gacagtcagt 


agatgctcct 


tgcctgaagt 


gtatcttgca 


gtagccttac 


actgccttgt 


60 


agctggagag 


ccttgtatca 


ctgctaacaa 


cactgggtat 


caacaaagaa 


gacaactgca 


120 


taaaaatgaa 


ataaggagat 


actacccagg 


ctggggttta 


ttttgagtct 


ttcagaactt 


180 


tctagggata 


gaatatctct 


gctgacatga 


agtcagattc 


accagtgcac 


agggagtccc 


240 


atactggatg 


ccagtcacca 


tgcccactaa 


ggtgcttgcc 


agccaggctg 


gaaggctcta 


300 


ccaagagacg 


tctggctgcc 


aatgccagag 


aaaggaggag 


aatgcaagga 


ctaaataccg 


360 


ccttcgatag 


tctgaggaaa 


gttgtaccac 


aatggggtga 


agacaaacaa 


ctttccaaat 


420 


atgagactct 


gcagatggcg 


ctgagctaca 


tcatggcact 


gagcaggatc 


ctctcggaag 


480 


cagagaggta 


cagcaggact 


gatccagagg 


aatggactaa 


tattcaatat 


gatcacattg 


540 


aggaggagca 


gtgcctcagt 


tatatggagg 


tgagatgccc 


aagagactgt 


gatcgttacc 


600 


tgccccagac 


tttttctcac 


taggataaga 


gcaggcaagg 


tactactgac 


ctgaagacag 


660 


cactgtttta 


atataatggg 


tcggttatac 


agcacccaat 


gatacaggga 


catttgcatc 


720 


atgggctatt 


tgtcatgttg 


tttttcccaa 


tgcaatgctt 


atttccttaa 


gcaccctacg 


780 


ttcaggactg 


ggtacatatg 


tagggaaccc 


caagtgatgc 


agcccagagg 


atgcggtgct 


840 


gcaacggatg 


gcagttagtg 


agctgtggaa 


cactggaaaa 


gccaagtgca 


ctggtatttt 


900 


gtgaaaggac 


atgcaagtta 


ttatgggtgg 


catataatat 


ttacctccat 


acagcacagt 


960 


gatataactt 


cattgcctcc 


atcatacctg 


tgtgattata 


tataaaatgg 


tagttcctga 


1020 


gtcactactt 


tccatgttac 


ttatgcactg 


ttatcagata 


acatagagaa 


agtagtattt 


1080 


atacattaga 


aaaagtacta 


tatgtgcata 


taaattggcg 


tttaaagcag 


tctggataaa 


1140 



tcattttctg 


tggactcttt 


tcttacccca 


aggagccatt 


tatagtttaa 


gtgctgctat 


1200 


aggcactgct 


cataagggaa 


aggaaacatt 


tttgtcactt 


gctgataata 


caaattgcat 


1260 


t ctacaccag 


aattcttaat 


taactatact 


gtctagtgaa 


cagaaaggtc 


taaattaaaa 


1320 


t caacaagag 


gtataattta 


catattttac 


cattttctgg 


cttctgtctg 


ccttttgaga 


1380 


gtggcaaatt 


acaacgtata 


taaaattcct 


acaggtctca 


atctacttgt 


tgctgtaaag 


1440 


a caaaccttg 


gtgtattttt 


aaggaaacct 


taagtaattc 


ttttggttag 


tatacatttc 


1500 


ttaaatattt 


ttatttgtaa 


tgggcgtcaa 


ttgtttattt 


gtaaacgcag 


ctccaaaaat 


1560 


agcagatatg 


cttgtatttt 


tcaggtatgc 


aaatggtatt 


tcagcctata 


ttgggaaagc 


1620 


agtaagcaaa 


tctttgtaat 


tgaaattgta 


gtaaaactga 


acatatggtg 


aattgcttgg 


1680 


caatgtactg 


tatacatgct 


gacctataag 


ggctatgtag 


caaagttggg 


aaaaagtttt 


1740 


ccctatcaaa 


atcactgcag 


gtttattggt 


ttgttgcaga 


tagaataata 


aaatctgaca 


1800 


ttgctgatgt 


gggtaaggta 


ttttattccc 


catttaatat 


cttaatcttt 


cttcttacac 


1860 


tgaaaacatt 


actagcccct 


tgccaatgta 


ttcttatctg 


tcaaattcag 


gagttcatga 


1920 




L. cl ^ ci Lad '■w 


L. CI L. Vj CL d G. ^ ^ 




crt~ nrrnTT-H f\ 1~ p\ 


Pi HPiHCfPl'\"t"f~CT 


1980 


ctgcacaaat 


tagggcagag 


atacatggtc 


tgactgtgac 


aaatctcctc 


ttcttcggag 


2040 


cgtttaatct 


ccccaaactg 


cctttccagg 


cactttgttc 


tccgaagtcg 


cccgaagttg 


2100 


cctaacgagg 


caacttcagc 


acg 








2123 



<210> 52 

<211> 138 

<212> PRT 

<213> FROG 

<400> 52 

Met Lys Ser Asp Ser Pro Val His Arg Glu Ser His Thr Gly Cys Gin 
15 10 15 

Ser Pro Cys Pro Leu Arg Cys Leu Pro Ala Arg Leu Glu Gly Ser Thr 
20 25 30 

Lys Arg Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Gin Gly 
35 40 45 

Leu Asn Thr Ala Phe Asp Ser Leu Arg Lys Val Val Pro Gin Trp Gly 
50 55 60 

Glu Asp Lys Gin Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Leu Ser 
65 70 75 80 

Tyr lie Met Ala Leu Ser Arg lie Leu Ser Glu Ala Glu Arg Tyr Ser 
85 90 95 

Arg Thr Asp Pro Glu Glu Trp Thr Asn lie Gin Tyr Asp His lie Glu 



100 



105 



110 



Glu Glu Gin Cys Leu Ser Tyr Met Glu Val Arg Cys Pro Arg Asp Cys 
115 120 125 

Asp Arg Tyr Leu Pro Gin Thr Phe Ser His 
130 135 

<210> 53 

<211> 136 

<212> DNA 

<213> BEETLE 

<400> 53 

gcggcgaatg cgcgcgagcg gcggcggatg aacggcctga atgaagcttt cgatcggcta 60 

agacaagtta taccaagctt ggacgctgac cacaaattga gcaagtttga gactctgcag 120 

atggcccaga cctaca 136 



<210> 54 

<211> 45 

<212> PRT 

<213> BEETLE 

<400> 54 

Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Asn Gly Leu Asn Glu Ala 
15 10 15 

Phe Asp Arg Leu Arg Gin Val lie Pro Ser Leu Asp Ala Asp His Lys 
20 25 30 

Leu Ser Lys Phe Glu Thr Leu Gin Met Ala Gin Thr Tyr 
35 40 45 

<210> 55 

<211> 137 

<212> DNA 

<213> BEETLE 

<400> 55 

gcggcgaatg cgagggagag gcggaggatg aacagtttga atgacgcctt cgacaggctg 60 

cgggacgtgg tgccgtccct tgggaacgat cggaagctgt ccaagtttga gacacttcag 120 

atggcccaga cctacat 137 



<210> 56 

<211> 45 

<212> PRT 

<213> BEETLE 

<400> 56 

Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Asn Ser Leu Asn Asp Ala 
15 10 15 



Phe Asp Arg Leu Arg Asp Val Val Pro Ser Leu Gly Asn Asp Arg Lys 



20 



25 



30 



Leu Ser Lys Phe Glu Thr Leu Gin Met Ala Gin Thr Tyr 
35 40 45 

<210> 57 

<211> 1572 

<212> DNA 

<213> HUMAN 

<220> 

<221> misc_feature 

<222> (1497) . . (1497) 

<223> rr n" can be any nucleotide 



<220> 

<221> misc__f eature 

<222> (1504) . . (1504) 

<223> "n" can be any nucleotide 



<220> 

<221> misc__f eature 

<222> (1526) . . (1526) 

<223> "n" can be any nucleotide 



<220> 

<221> mis cofeature 

<222> (1564) . . (1564) 

<223> "n" can be any nucleotide 

<400> 57 

gtcctctgca cacaagaact tttctcgggg tgtaaaaact ctttgattgg ctgctcgcac 60 

gcgcctgccc gcgccctcca ttggctgaga agacacgcga ccggcgcgag gagggggttg 120 

ggagaggagc ggggggagac tgagtggcgc gtgccgcttt ttaaaggggc gcagcgcctt 180 

cagcaaccgg agaagcatag ttgcacgcga cctggtgtgt gatctccgag tgggtggggg 24 0 

agggtcgagg agggaaaaaa aaataagacg ttgcagaaga gacccggaaa gggccttttt 300 

tttggttgag ctggtgtccc agtgctgcct ccgatcctga gcgtccgagc ctttgcagtg 360 

caatgtcccg cctgctgcat gcagaagagt gggctgaagt gaaggagttg ggagaccacc 4 20 

atcgccagcc ccagccgcat catctcccgc aaccgccgcc gccgccgcag ccacctgcaa 480 

ctttgcaggc gagagagcat cccgtctacc cgcctgagct gtccctcctg gacagcaccg 54 0 

acccacgcgc ctggctggct cccactttgc agggcatctg cacggcacgc gccgcccagt 600 

atttgctaca ttccccggag ctgggtgcct cagaggccgc tgcgccccgg gacgaggtgg 660 

acggccgggg ggagctggta aggaggagca gcggcggtgc cagcagcagc aagagccccg 72 0 

ggccggtgaa agtgcgggaa cagctgtgca agctgaaagg cggggtggtg gtagacgagc 780 



tgggctgcag 


ccgccaacgg 


gccccttcca 


gcaaacaggt 


gaatggggtg 


cagaagcaga 


840 


gacggctagc 


agccaacgcc 


agggagcggc 


gcaggatgca 


tgggctgaac 


cacgccttcg 


900 


accagctgcg 


caatgttatc 


ccgtcgttca 


acaacgacaa 


gaagctgtcc 


aaatatgaga 


960 


ccctgcagat 


ggcccaaatc 


tacatcaacg 


ccttgtccga 


gctgctacaa 


acgcccagcg 


1020 


gaggggaaca 


gccaccgccg 


cctccagcct 


cctgcaaaag 


cgaccaccac 


caccttcgca 


1080 


ccgcggcctc 


ctatgaaggg 


ggcgcgggca 


acgcgaccgc 


agctggggct 


cagcaggctt 


1140 


ccggagggag 


ccagcggccg 


accccgcccg 


ggagttgccg 


gactcgcttc 


tcagccccag 


1200 


cttctgcggg 


agggtactcg 


gtgcagctgg 


acgctctgca 


cttctcgact 


ttcgaggaca 


1260 


gcgccctgac 


agcgatgatg 


gcgcaaaaga 


atttgtctcc 


ttctctcccc 


gggagcatct 


1320 


tgcagccagt 


gcaggaggaa 


aacagcaaaa 


cttcgcctcg 


gtcccacaga 


agcgacgggg 


1380 


aattttcccc 


ccattcccat 


tacagtgact 


cqqatgaggc 


aagttaggaa 


ggtgacagaa 


1440 


gcctgaaaac 


tgagacagaa 


acaaaactgc 


cctttcccag 


tgcgcgggaa 


gccccgnggt 


1500 


taangatccc 


cgcacccttt 


aatttnggct 


ctgcgatggt 


cgttgtttag 


caacgacttg 


1560 


gctncagatg 


gt 










1572 



<210> 58 

<211> 354 

<212> PRT 

<213> HUMAN 

<400> 58 

Met Ser Arg Leu Leu His Ala Glu Glu Trp Ala Glu Val Lys Glu Leu 
15 10 15 

Gly Asp His His Arg Gin Pro Gin Pro His His Leu Pro Gin Pro Pro 
20 25 30 

Pro Pro Pro Gin Pro Pro Ala Thr Leu Gin Ala Arg Glu His Pro Val 
35 40 45 

Tyr Pro Pro Glu Leu Ser Leu Leu Asp Ser Thr Asp Pro Arg Ala Trp 
50 55 60 

Leu Ala Pro Thr Leu Gin Gly lie Cys Thr Ala Arg Ala Ala Gin Tyr 
65 70 75 80 

Leu Leu His Ser Pro Glu Leu Gly Ala Ser Glu Ala Ala Ala Pro Arg 
85 90 95 

Asp Glu Val Asp Gly Arg Gly Glu Leu Val Arg Arg Ser Ser Gly Gly 
100 105 110 

Ala Ser Ser Ser Lys Ser Pro Gly Pro Val Lys Val Arg Glu Gin Leu 
115 120 125 

Cys Lys Leu Lys Gly Gly Val Val Val Asp Glu Leu Gly Cys Ser Arg 



130 



135 



140 



Gin Arg Ala Pro Ser Ser Lys Gin Val Asn Gly Val Gin Lys Gin Arg 
145 150 155 160 

Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn 
165 170 175 

His Ala Phe Asp Gin Leu Arg Asn Val lie Pro Ser Phe Asn Asn Asp 
180 185 190 

Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Gin lie Tyr lie 
195 200 205 

Asn Ala Leu Ser Glu Leu Leu Gin Thr Pro Ser Gly Gly Glu Gin Pro 
210 215 220 

Pro Pro Pro Pro Ala Ser Cys Lys Ser Asp His His His Leu Arg Thr 
225 230 235 240 

Ala Ala Ser Tyr Glu Gly Gly Ala Gly Asn Ala Thr Ala Ala Gly Ala 
245 250 255 

Gin Gin Ala Ser Gly Gly Ser Gin Arg Pro Thr Pro Pro Gly Ser Cys 
260 265 270 

Arg Thr Arg Phe Ser Ala Pro Ala Ser Ala Gly Gly Tyr Ser Val Gin 
275 280 285 

Leu Asp Ala Leu His Phe Ser Thr Phe Glu Asp Ser Ala Leu Thr Ala 
290 295 300 

Met Met Ala Gin Lys Asn Leu Ser Pro Ser Leu Pro Gly Ser lie Leu 
305 310 315 320 

Gin Pro Val Gin Glu Glu Asn Ser Lys Thr Ser Pro Arg Ser His Arg 
325 330 335 

Ser Asp Gly Glu Phe Ser Pro His Ser His Tyr Ser Asp Ser Asp Glu 
340 345 350 



Ala Ser 



<210> 59 

<211> 485 

<212> DNA 

<213> CHICKEN 

<220> 

<221> misc_f eature 

<222> (147) . . (147) 

<223> "n" can be any nucleotide 



<400> 59 

ccgctgctgg ggccggacgg ggcggctgcg gcttcgcccc cggctggctg ggcgtgtgct 



gcgccgcacg cgtgcccgcc gcgtcgccgc gctacctgct gcccgccgac gaggaggacg 
aggcggcccg tggcgggggg cgcggcncgc gttccggcgg gagcagcccc gggggagcgc 



60 
120 
180 



g^ggcggcgg C( 3 ( 3 ( 3 C( 3 C 9 0 9 gggcggcggc ggcggggccg ggccgcgggc gcaggtgagc 24 0 

ggcgtgcaga agcagcggcg gctggcggcc aacgcgcggg agcggcggcg gatgcacggg 300 

ctgaaccacg ccttcgacca gctgcgtaat gtcatcccct ccttcaacaa cgacaagaag 360 

ctctccaagt acgagacgct gcagatggcg caaatctaca tcagcgccct cgccgagctg 420 

ctgcacgggc cgcccgcgcc ccccgagccg cccgccaagg ccgagctccg cggggccccc 4 80 

ttcga 485 



<210> 60 

<211> 161 

<212> PRT 

<213> CHICKEN 

<400> 60 

Pro Leu Leu Gly Pro Asp Gly Ala Ala Ala Ala Ser Pro Pro Ala Gly 
15 10 15 

Trp Ala Cys Ala Ala Pro His Ala Cys Pro Pro Arg Arg Arg Ala Thr 
20 25 30 

Cys Cys Pro Pro Thr Arg Arg Thr Arg Arg Pro Val Ala Gly Gly Ala 
35 40 45 

Ala Arg Val Pro Ala Gly Ala Ala Pro Gly Glu Arg Gly Ala Ala Ala 
50 55 60 

Gly Ala Arg Gly Gly Gly Gly Gly Ala Gly Pro Arg Ala Gin Val Ser 
65 70 75 80 

Gly Val Gin Lys Gin Arg Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg 
85 90 95 

Arg Met His Gly Leu Asn His Ala Phe Asp Gin Leu Arg Asn Val lie 
100 105 110 

Pro Ser Phe Asn Asn Asp Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin 
115 120 125 

Met Ala Gin He Tyr He Ser Ala Leu Ala Glu Leu Leu His Gly Pro 
130 135 140 

Pro Ala Pro Pro Glu Pro Pro Ala Lys Ala Glu Leu Arg Gly Ala Pro 
145 150 155 160 

Phe 



<210> 61 

<211> 138 

<212> DNA 

<213> PUFFER FISH 



<400> 61 

gcggcgaacg cgagggagag gaggagaatg cacggcctga ataaagcgtt tgacgaactg 



60 



aggagcgtca ttccttccct ggaaaatgag agaaagctct ccaagtatga cactctccag 
atggcccaaa cctacatc 



120 
138 



<210> 62 

<211> 46 

<212> PRT 

<213> PUFFER FISH 

<4O0> 62 

Ala Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn Lys Ala 
15 10 15 

Phe Asp Glu Leu Arg Ser Val lie Pro Ser Leu Glu Asn Glu Arg Lys 
20 25 30 

Leu Ser Lys Tyr Asp Thr Leu Gin Met Ala Gin Thr Tyr lie 
35 40 45 

<210> 63 

<211> 1477 

<212> DNA 

<213> DROSOPHILA 



<400> 63 
atcatcttgt 


tagcggcttt 


agagccgaat 


cgttttctag 


cgccatttta 


agctcgcaac 


gaactgaggt 


ataaccgggc 


tctctgagac 


cgctgcaact 


caccaccaac 


tgccattggt 


cgtgccactc 


gggcggcacg 


tgctgccttc 


tgtggcaact 


cgtttacctg 


cccccctacc 


tgcctttcag 


gcccttctga 


ccgtcgtggt 


ggatttgtga 


gtataaatag 


ggccgaaagg 


acgagagacc 


agtcagaaac 


ccgccagcac 


tcgcagcgtt 


cgtatcgttt 


catccagcaa 


cataacacca 


ccatacagca 


gcagcaacat 


gtcgtccagt 


gagatctatc 


gctactacta 


caagacctcc 


gaggacttgc 


agggcttcaa 


gacagccgcc 


gccgagccgt 


acttcaatcc 


catggcagcc 


tacaatcccg 


gcgtgaccca 


ctaccagttc 


aatggcaaca 


ccctggccag 


cagcagcaac 


tacttgtcgg 


ccaatggctt 


catcagcttc 


gagcaggcca 


gttccgatgg 


ctggatctcc 


tcctcgccgg 


ctagccaccg 


atctgagagt 


cccgagtatg 


tggatctcaa 


taccatgtac 


aatggaggct 


gcaacaacat 


ggcccagaac 


caacaatacg 


gaatgattat 


ggagcagtct 


gttgtttcca 


cagcgcctgc 


aattccagtg 


gcctctcctc 


cggcagtgga 


ggtcatgggc 


tcctccaacg 


tgggcacttg 


caaaacgatt 


ccagcctcag 


cagctccgaa 


accgaagcgt 


agctatacca 


agaagaacca 


gccaagcacc 


accgccacct 


ccacaccgac 


tgcagctgcg 


gagtcatctg 


cctcagtgaa 


tctctacacg 


gaggagttcc 


agaactttga 


ctttgacaac 


tccgccttgt 


tcgatgacag 


cgtcgaggat 


gacgaggacc 


tcatgctctt 


cagtggcggt 


gaggacttcg 


atggcaatga 


tggatccttt 


gacttggccg 


atggtgagaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1477 



ccaagatgcc gctgccggag gctctggaaa gaagaggcgt ggcaagcaga tcacacccgt 108 0 

cgtgaagagg aagcgtcgcc tggccgccaa tgcacgtgag cgtcgtcgga tgcagaacct 1140 

■caaccaggcc ttcgatcgtc tccgccagta ccttccctgt ctgggaaacg atcgccagct 1200 

gtccaaacac gagaccctcc aaatggccca gacctacata tccgctctcg gggatctgct 12 60 

gcgctgaatt cccggatccc gatcccagtc ccaagtacta ttctcagtta ttgttggagc 1320 

ttgccaaatg ttgtagctac tttgtatata ttgcctggag cccagtagtg aattaccgct 1380 

taagtattat gctgtttatt gtttagttaa ttagcctaaa tggaagacaa tgattaagac 1440 
taaggaagac aaaataaaag caccattaat aatttaa 

<210> 64 

<211> 312 

<212> PRT 

<213> DROSOPHILA 

<400> 64 

Met Ser Ser Ser Glu He Tyr Arg Tyr Tyr Tyr Lys Thr Ser Glu Asp 
15 10 15 

Leu Gin Gly Phe Lys Thr Ala Ala Ala Glu Pro Tyr Phe Asn Pro Met 
20 25 30 

Ala Ala Tyr Asn Pro Gly Val Thr His Tyr Gin Phe Asn Gly Asn Thr 
35 40 45 

Leu Ala Ser Ser Ser Asn Tyr Leu Ser Ala Asn Gly Phe He Ser Phe 
50 55 60 

Glu Gin Ala Ser Ser Asp Gly Trp He Ser Ser Ser Pro Ala Ser His 
65 70 75 80 

Arq Ser Glu Ser Pro Glu Tyr Val Asp Leu Asn Thr Met Tyr Asn Gly 
85 90 95 

Gly Cys Asn Asn Met Ala Gin Asn Gin Gin Tyr Gly Met He Met Glu 
100 105 HO 

Gin Ser Val Val Ser Thr Ala Pro Ala He Pro Val Ala Ser Pro Pro 
115 120 125 

Ala Val Glu Val Met Gly Ser Ser Asn Val Gly Thr Cys Lys Thr He 
130 135 140 

Pro Ala Ser Ala Ala Pro Lys Pro Lys Arg Ser Tyr Thr Lys Lys Asn 
145 150 155 160 

Gin Pro Ser Thr Thr Ala Thr Ser Thr Pro Thr Ala Ala Ala Glu Ser 
165 170 175 

Ser Ala Ser Val Asn Leu Tyr Thr Glu Glu Phe Gin Asn Phe Asp Phe 
180 185 190 



Asp Asn Ser Ala Leu Phe Asp Asp Ser Val Glu Asp Asp Glu Asp Leu 

195 200 205 

Met Leu Phe Ser Gly Gly Glu Asp Phe Asp Gly Asn Asp Gly Ser Phe 

210 215 220 

Asp Leu Ala Asp Gly Glu Asn Gin Asp Ala Ala Ala Gly Gly Ser Gly 

225 230 235 240 

Lys Lys Arg Arg Gly Lys Gin lie Thr Pro Val Val Lys Arg Lys Arg 



Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Gin Asn Leu Asn 
260 265 270 

Gin Ala Phe Asp Arg Leu Arg Gin Tyr Leu Pro Cys Leu Gly Asn Asp 
275 280 285 

Arg Gin Leu Ser Lys His Glu Thr Leu Gin Met Ala Gin Thr Tyr lie 
2 50 295 300 

Ser Ala Leu Gly Asp Leu Leu Arg 
305 310 

<210> 65 

<211> 907 

<212> DNA 

<213> FROG 

<400> 65 

gccccggggc cactctgcgc acttgtcggg acttattcgc acttacctgt catggcccgt 60 

ctgctacacg gcgctgctac tgccgctgac tggtgcgagc tgaaggagct tccatccgag 120 

gccgggctct tggccagaga ttacctacta gacagcagcg acccccgcgc ctggctctcc 180 

gccacttccc tgcaaagtcg ccctgagtac gtgctgcacc ccccgggccg ggccgggcgc 240 

acaaggtgcg ggaactgtgc aaactgaagg ggctgcggga tgatgatgat gatgaggagg 300 

atgatgagga ggaggaagag agatccgagg ggctgtgcag acacaggggt ccccctggca 360 

agggccctgg tggggttcag aagcagagga gactggcagc caatgccagg gagaggagga 420 

ggatgcacgg gctcaatcat gccttcgatc agctccgtaa tgtcatccct tccttcaata 480 

acgacaagaa actctccaaa tacgagaccc tgcagatggc tcagatctac atcaacgccc 540 

tgtccgacct gctgcaggcg ccccccgact ccagagatcc cccctgcccg cccacctacc 600 

aactgcattc ggggccagag cccaggttag tccagtctgg cagcatgaga ttctcggaga 660 

cttccccccg acagtccccc ctcagccaat tccaggaggg agctgctccc agaagggaat 720 

aggatctggg cccatcttca tcttctcggg aagacatcgc ccatcttcat cttcggggag 780 

aagacagcaa gacatcgcaa gatctcatcg gagtgacggc gaattccggt ctccctatag 840 

tgagtcgtat taatttcgat aagccagctg cattaatgaa tcggccaaac gcgcggggag 900 

aggcggt 907 



<210> 66 

<211> 259 

<212> PRT 

<213> FROG 

<400> 66 

Met Ala Arg Leu Leu His Gly Ala Ala Thr Ala Ala Asp Trp Cys Glu 
15 10 15 

Leu Lys Glu Leu Pro Ser Glu Ala Gly Leu Leu Ala Arg Asp Tyr Leu 
20 25 30 

Leu Asp Ser Ser Asp Pro Arg Ala Trp Leu Ser Ala Thr Ser Leu Gin 
35 40 45 

Ser Arg Pro Glu Tyr Val Leu His Pro Pro Gly Arg Ala His Lys Val 
50 55 60 

Arg Glu Leu Cys Lys Leu Lys Gly Leu Arg Asp Asp Asp Asp Asp Glu 
65 70 75 80 

Glu Glu Asp Asp Glu Glu Glu Glu Glu Arg Ser Glu Gly Leu Cys Arg 
85 90 95 

His Arg Gly Pro Pro Gly Lys Gly Pro Gly Gly Val Gin Lys Gin Arg 
100 105 110 

Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn 
115 120 125 

His Ala Phe Asp Gin Leu Arg Asn Val lie Pro Ser Phe Asn Asn Asp 
130 135 140 

Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Gin lie Tyr lie 
145 150 155 160 

Asn Ala Leu Ser Asp Leu Leu Gin Ala Pro Pro Asp Ser Arg Asp Pro 
165 170 175 

Pro Cys Pro Pro Thr Tyr Gin Leu His Ser Gly Pro Glu Pro Arg Leu 
180 185 190 

Val Gin Ser Gly Ser Cys Met Arg Phe Ser Gly Asp Phe Pro Gly Gin 
195 200 205 

Ser Pro Leu Ser Phe Gin Phe Gin Glu Gly Ala Ala Leu Ser Gly Lys 
210 215 220 

Gly lie Gly Ser Ala Pro Ser Ser Ser Ser Gly Glu Asp Ser Lys Thr 
225 230 235 240 

Ser Pro Arg Ser His Arg Ser Asp Gly Glu Phe Arg Ser Pro Tyr Ser 
245 250 255 

Glu Ser Tyr 



<210> 67 



<211> 19 
<212> DNA 

<213> SYNTHETIC CONSTRUCT 
<400> 67 

tgaagctttt ggctttgag 



<210>, 68 
<211>" 19 
<212> DNA 

<213> SYNTHETIC CONSTRUCT 
<400> 68 

ccgctgccaa attctttgg 



<210> 69 

<211> 37 

<212> DNA 

<213> HUMAN 



<400> 69 

gggggcactg acagtaatgc atgccgtatt cgaagtt 



